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Preface 


Since Richard Feynman’s death in 1988 it has become increasingly 
evident that he was one of the most brilliant and original theoreti- 
cal physicists of the twentieth century.! The Nobel Prize in Physics 
for 1965, shared with Julian Schwinger and Sin-itiro Tomonaga, re- 
warded their independent path-breaking work on the renormaliza- 
tion theory of quantum electrodynamics (QED). Feynman based his 
own formulation of a consistent QED, free of meaningless infinities, 
upon the work in his doctoral thesis of 1942 at Princeton Univer- 
sity, which is published here for the first time. His new approach to 
quantum theory made use of the Principle of Least Action and led 
to methods for the very accurate calculation of quantum electromag- 
netic processes, as amply confirmed by experiment. These methods 
rely on the famous “Feynman diagrams,” derived originally from the 
path integrals, which fill the pages of many articles and textbooks. 
Applied first to QED, the diagrams and the renormalization pro- 
cedure based upon them also play a major role in other quantum 
field theories, including quantum gravity and the current “Standard 
Model” of elementary particle physics. The latter theory involves 
quarks and leptons interacting through the exchange of renormaliz- 
able Yang-Mills non-Abelian gauge fields (the electroweak and color 
gluon fields). 

The path-integral and diagrammatic methods of Feynman are im- 
portant general techniques of mathematical physics that have many 
applications other than quantum field theories: atomic and molecu- 
lar scattering, condensed matter physics, statistical mechanics, quan- 
tum liquids and solids, Brownian motion, noise, etc.? In addition to 


! Hans Bethe’s obituary of Feynman [Nature 332 (1988), p. 588] begins: “Richard P. 
Feynman was one of the greatest physicists since the Second World War and, I believe, 
the most original.” 

2 Some of these topics are treated in R. P. Feynman and A. R. Hibbs, Quantum 
Mechanics and Path Integrals (McGraw-Hill, Massachusetts, 1965). Also see M. C. 
Gutzwiller, “Resource Letter ICQM-1: The Interplay Between Classical and Quantum 
Mechanics,” Am. J. Phys. 66 (1998), pp. 304-24; items 71-73 and 158-168 deal with 
path integrals. 
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its usefulness in these diverse fields of physics, the path-integral ap- 
proach brings a new fundamental understanding of quantum theory. 
Dirac, in his transformation theory, demonstrated the complementar- 
ity of two seemingly different formulations: the matrix mechanics of 
Heisenberg, Born, and Jordan and the wave mechanics of de Broglie 
and Schrodinger. Feynman’s independent path-integral theory sheds 
new light on Dirac’s operators and Schrédinger’s wave functions, and 
inspires some novel approaches to the still somewhat mysterious in- 
terpretation of quantum theory. Feynman liked to emphasize the 
value of approaching old problems in a new way, even if there were 
to be no immediate practical benefit. 


Early Ideas on Electromagnetic Fields 


Growing up and educated in New York City, where he was born 
on 11 May 1918, Feynman did his undergraduate studies at the 
Massachusetts Institute of Technology (MIT), graduating in 1939. 
Although an exceptional student with recognized mathematical 
prowess, he was not a prodigy like Julian Schwinger, his fellow New 
Yorker born the same year, who received his PhD in Physics from 
Columbia University in 1939 and had already published fifteen arti- 
cles. Feynman had two publications at MIT, including his undergrad- 
uate thesis with John C. Slater on “Forces and Stresses in Molecules.” 
In that work he proved a very important theorem in molecular and 
solid-state physics, which is now known as the Hellmann—Feynman 
theorem.® 

While still an undergraduate at MIT, as he related in his Nobel 
address, Feynman devoted much thought to electromagnetic inter- 
actions, especially the self-interaction of a charge with its own field, 
which predicted that a pointlike electron would have an infinite mass. 
This unfortunate result could be avoided in classical physics, either 
by not calculating the mass, or by giving the theoretical electron an 


3 L. M. Brown (ed.), Selected Papers of Richard Feynman, with Commentary (World 
Scientific, Singapore, 2000), p. 3. This volume (hereafter referred to as SP) includes a 
complete bibliography of Feynman’s work. 
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extended structure; the latter choice makes for some difficulties in 
relativistic physics. 

Neither of these solutions are possible in QED, however, because 
the extended electron gives rise to non-local interaction and the in- 
finite pointlike mass inevitably contaminates other effects, such as 
atomic energy level differences, when calculated to high accuracy. 
While at MIT, Feynman thought that he had found a simple solu- 
tion to this problem: Why not assume that the electron does not 
experience any interaction with its own electromagnetic field? When 
he began his graduate study at Princeton University, he carried this 
idea with him. He explained why in his Nobel Address:* 


Well, it seemed to me quite evident that the idea that a 
particle acts on itself is not a necessary one — it is a sort of 
silly one, as a matter of fact. And so I suggested to myself 
that electrons cannot act on themselves; they can only act on 
other electrons. That means there is no field at all. There 
was a direct interaction between charges, albeit with a delay. 


A new classical electromagnetic field theory of that type would 
avoid such difficulties as the infinite self-energy of the point electron. 
The very useful notion of a field could be retained as an auxiliary 
concept, even if not thought to be a fundamental one. There was 
a chance also that if the new theory were quantized, it might elim- 
inate the fatal problems of the then current QED. However, Feyn- 
man soon learned that there was a great obstacle to this delayed 
action-at-a-distance theory: namely, if a radiating electron, say in 
an atom or an antenna, were not acted upon at all by the field that 
it radiated, then it would not recoil, which would violate the conser- 
vation of energy. For that reason, some form of radiative reaction is 
necessary. 


4 SP, pp. 9-32, especially p. 10. 
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The Wheeler—Feynman Theory 


Trying to work through this problem at Princeton, Feynman 
asked his future thesis adviser, the young Assistant Professor John 
Wheeler, for help. In particular, he asked whether it was possible 
to consider that two charges interact in such a way that the second 
charge, accelerated by absorbing the radiation emitted by the first 
charge, itself emits radiation that reacts upon the first. Wheeler 
pointed out that there would be such an effect but, delayed by the 
time required for light to pass between the two particles, it could not 
be the force of radiation reaction, which is instantaneous; also the 
force would be much too weak. What Feynman had suggested was 
not radiation reaction, but the reflection of light! 

However, Wheeler did offer a possible way out of the difficulty. 
First, one could assume that radiation always takes place in a to- 
tally absorbing universe, like a room with the blinds drawn. Second, 
although the principle of causality states that all observable effects 
take place at a time later than the cause, Maxwell’s equations for 
the electromagnetic field possess a radiative solution other than that 
normally adopted, which is delayed in time by the finite velocity of 
light. In addition, there is a solution whose effects are advanced in 
time by the same amount. A linear combination of retarded and 
advanced solutions can also be used, and Wheeler asked Feynman to 
investigate whether some suitable combination in an absorbing uni- 
verse would provide the required observed instantaneous radiative 
reaction? 

Feynman worked out Wheeler’s suggestion and found that, in- 
deed, a mixture of one-half advanced and one-half retarded inter- 
action in an absorbing universe would exactly mimic the result of 
a radiative reaction due to the electron’s own field emitting purely 
retarded radiation. The advanced part of the interaction would stim- 
ulate a response in the electrons of the absorber, and their effect at 
the source (summed over the whole absorber) would arrive at just 
the right time and in the right strength to give the required radia- 
tion reaction force, without assuming any direct interaction of the 
electron with its own radiation field. Furthermore, no apparent 
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violation of the principle of causality arises from the use of advanced 
radiation. Wheeler and Feynman further explored this beautiful the- 
ory in articles published in the Reviews of Modern Physics (RMP) 
in 1945 and 1949.° In the first of these articles, no less than four 
different proofs are presented of the important result concerning the 
radiative reaction. 


Quantizing the Wheeler-Feynman theory (Feynman’s 
PhD thesis): The Principle of Least Action in 
Quantum Mechanics 


Having an action-at-a-distance classical theory of electromagnetic 
interactions without fields, except as an auxiliary device, the ques- 
tion arises as to how to make a corresponding quantum theory. 
To treat a classical system of interacting particles, there are avail- 
able analytic methods using generalized coordinates, developed by 
Hamilton and Lagrange, corresponding canonical transformations, 
and the principle of least action.© The original forms of quantum 
mechanics, due to Heisenberg, Schrédinger, and Dirac, made use 
of the Hamiltonian approach and its consequences, especially Pois- 
son brackets. To quantize the electromagnetic field it was repre- 
sented, by Fourier transformation, as a superposition of plane waves 
having transverse, longitudinal, and timelike polarizations. A given 
field was represented as mathematically equivalent to a collection of 
harmonic oscillators. A system of interacting particles was then de- 
scribed by a Hamiltonian function of three terms representing respec- 
tively the particles, the field, and their interaction. Quantization con- 
sisted of regarding these terms as Hamiltonian operators, the field’s 
Hamiltonian describing a suitable infinite set of quantized harmonic 
oscillators. The combination of longitudinal and timelike oscillators 


5 


° SP, p. 35-59 and p. 60-68. The second paper was actually written by Wheeler, based 
upon the joint work of both authors. It is remarked in these papers that H. Tetrode, 
W. Ritz, and G. N. Lewis had independently anticipated the absorber idea. 

6 W. Yourgrau and S. Mandelstam give an excellent analytic historical account in 
Variational Principles in Dynamics and Quantum Theory (Saunders, Philadelphia, 3rd 
edn., 1968). 
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was shown to provide the (instantaneous) Coulomb interaction of the 
particles, while the transverse oscillators were equivalent to photons. 
This approach, as well as the more general approach adopted by 
Heisenberg and Pauli (1929), was based upon Bohr’s correspondence 
principle. 

However, no method based upon the Hamiltonian could be used 
for the Wheeler—Feynman theory, either classically or quantum me- 
chanically. The principal reason was the use of half-advanced and 
half-retarded interaction. The Hamiltonian method describes and 
keeps track of the state of the system of particles and fields at a 
given time. In the new theory, there are no field variables, and ev- 
ery radiative process depends on contributions from the future as 
well as from the past! One is forced to view the entire process from 
start to finish. The only existing classical approach of this kind for 
particles makes use of the principle of least action, and Feynman’s 
thesis project was to develop and generalize this approach so that it 
could be used to formulate the Wheeler-Feynman theory (a theory 
possessing an action, but without a Hamiltonian). If successful, he 
should then try to find a method to quantize the new theory.’ 


The Introduction to the Thesis 


Presenting his motivation and giving the plan of the thesis, 
Feynman’s introductory section laid out the principal features of 
the (not yet published) delayed electromagnetic action-at-a-distance 
theory as described above, including the postulate that “fundamen- 
tal (microscopic) phenomena in nature are symmetrical with respect 
to the interchange of past and future.” Feynman claimed: “This 
requires that the solution of Maxwell’s equation|s] to be used in 
computing the interactions is to be half the retarded plus half the 
advanced solution of Lienard and Wiechert.” Although it would ap- 
pear to contradict causality, Feynman stated that the principles of 


7 For a related discussion, including Feynman’s PhD thesis, see S. S. Schweber, QED 


and the Men Who Made It: Dyson, Feynman, Schwinger, and Tomonaga (Princeton 
University Press, Princeton, 1994), especially pp. 389-397. 
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the theory “do in fact lead to essential agreement with the results 
of the more usual form of electrodynamics, and at the same time 
permit a consistent description of point charges and lead to a unique 
law of radiative damping .... It is shown that these principles are 
equivalent to the equations of motion resulting from a principle of 
least action.” 

To explain the spontaneous decay of excited atoms and the ex- 
istence of photons, both seemingly contradicted by this view, Feyn- 
man argued that “an atom alone in empty space would, in fact, not 
radiate ... and all of the apparent quantum properties of light and 
the existence of photons may be nothing more than the result of 
matter interacting with matter directly, and according to quantum 
mechanical laws.” 

Two important points conclude the introduction. First, although 
the Wheeler—Feynman theory clearly furnished its motivation: “It is 
to be emphasized ... that the work described here is complete in itself 
without regard to its application to electrodynamics ... [The] present 
paper is concerned with the problem of finding a quantum mechanical 
description applicable to systems which in their classical analogue 
are expressible by a principle of least action, and not necessarily by 
Hamiltonian equations of motion.” The second point is this: “All of 
the analysis will apply to non-relativistic systems. The generalization 
to the relativistic case is not at present known.” 


Classical Dynamics Generalized 


The second section of the thesis discusses the theory of functionals 
and functional derivatives, and it generalizes the principle of least 
action of classical dynamics. Applying this method to the partic- 
ular example of particles interacting through the intermediary of 
classical harmonic oscillators (an analogue of the electromagnetic 
field), Feynman shows how the coordinates of the oscillators can be 
eliminated and how their role in the interaction is replaced by a direct 
delayed interaction of the particles. Before this elimination process, 
the system consisting of oscillators and particles possesses a Hamil- 
tonian but afterward, when the particles have direct interaction, no 
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Hamiltonian formulation is possible. Nevertheless, the equations of 
motion can still be derived from the principle of least action. This 
demonstration sets the stage for a similar procedure to be carried 
out in the quantized theory developed in the third and final section 
of the thesis. 

In classical dynamics, the action is given by 


s= / L(q(t), q(t))dt, 


where L is a function of the generalized coordinates q(t) and the 
generalized velocities ¢ = dq/dt, the integral being taken between 
the initial and final times tg and t,, for which the set of q’s have 
assigned values. The action depends on the paths q(t) taken by the 
particles, and thus it is a functional of those paths. The principle 
of least action states that for “small” variations of the paths, the 
end points being fixed, the action S is an extremum, in most cases a 
minimum. An equivalent statement is that the functional derivative 
of S is zero. In the usual treatment, this principle leads to the 
Lagrangian and Hamiltonian equations of motion. 

Feynman illustrates how this principle can be extended to the 
case of a particle (perhaps an atom) interacting with itself through 
advanced and retarded waves, by means of a mirror. An interaction 
term of the form k?#(t)4(t + T) is added to the Lagrangian of the 
particle in the action integral, T’ being the time for light to reach 
the mirror and return to the particle. (As an approximation, the 
limits of integration of the action integral are taken as negative and 
positive infinity.) A simple calculation, setting the variation of the 
action equal to zero, leads to the equation of motion of the particle. 
This shows that the force on the particle at time ¢ depends on the 
particle’s motion at times t, t— T, andt+7. That leads Feynman 
to observe: “The equations of motion cannot be described directly 
in Hamiltonian form.” 

After this simple example, there is a section discussing the re- 
strictions that are needed to guarantee the existence of the usual 
constants of motion, including the energy. The thesis then treats 
the more complicated case of particles interacting via intermediate 


oscillators. 
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XV 


It is shown how to eliminate the oscillators and obtain 


direct delayed action-at-a-distance. Interestingly, by making a suit- 


able choice of the action functional, one can obtain particles either 
with or without self-interaction. 
While still working on formulating the classical Wheeler—Feynman 
theory, Feynman was already beginning to adopt the over-all space- 


time approach that characterizes the quantization carried out in the 


thesis and in so much of his subsequent work, as he explained in his 
Nobel Lecture:® 
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By this time I was becoming used to a physical point of 
view different from the more customary point of view. In the 
customary view, things are discussed as a function of time 
in very great detail. For example, you have the field at this 
moment, a different equation gives you the field at a later 
moment and so on; a method, which I shall call the Hamii- 
tonian method, a time differential method. We have, instead 
[the action] a thing that describes the character of the path 
throughout all of space and time. The behavior of nature is 
determined by saying her whole space-time path has a certain 
character. For the action [with advanced and retarded terms] 
the equations are no longer at all easy to get back into Hamil- 
tonian form. If you wish to use as variables only the coordi- 
nates of particles, then you can talk about the property of the 
paths — but the path of one particle at a given time is affected 
by the path of another at a different time... . Therefore, you 
need a lot of bookkeeping variables to keep track of what the 
particle did in the past. These are called field variables .... 
From the overall space-time point of view of the least 
action principle, the field disappears as nothing but bookkeep- 
ing variables insisted on by the Hamiltonian method. 


Of the many significant contribution to theoretical physics that 
Feynman made throughout his career, perhaps none will turn out to 


“The development of the space-time view of quantum electrodynamics,” 


pp. 9-32, especially p. 16. 


SP, 
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be of more lasting value than his reformulation of quantum mechan- 
ics, complementing those of Heisenberg, Schrédinger, and Dirac.? 
When extended to the relativistic domain and including the quan- 
tized electromagnetic field, it forms the basis of Feynman’s version of 
QED, which is now the version of choice of theoretical physics, and 
which was seminal in the development of the gauge theories employed 
in the Standard Model of particle physics. !° 


Quantum Mechanics and the Principle of Least Action 


The third and final section of the thesis, together with the RMP 
article of 1949, presents the new form of quantum mechanics.!! In 
reply to a request for a copy of the thesis, Feynman said he had not 
an available copy, but instead sent a reprint of the RMP article, with 
this explanation of the difference: !? 


This article contains most of what was in the thesis. The 
thesis contained in addition a discussion of the relation be- 
tween constants of motion such as energy and momentum 
and invariance properties of an action functional. Further 
there is a much more thorough discussion of the possible gen- 


9 The action principle approach was later adopted also by Julian Schwinger. In dis- 


cussing these formulations, Yourgrau and Mandelstam comment: “One cannot fail to 
observe that Feynman’s principle in particular — and this is no hyperbole — expresses 
the laws of quantum mechanics in an exemplary neat and elegant manner, notwith- 
standing the fact that it employs somewhat unconventional mathematics. It can easily 
be related to Schwinger’s principle, which utilizes mathematics of a more familiar na- 
ture. The theorem of Schwinger is, as it were, simply a translation of that of Feynman 
into differential notation.” (Taken from Yourgrau and Mandelstam’s book [footnote 6], 
p. 128.) 

10 Although it had initially motivated his approach to QED, Feynman found later that 
the quantized version of the Wheeler-Feynman theory (that is, QED without fields) could 
not account for the experimentally observed phenomenon known as vacuum polarization. 
Thus in a letter to Wheeler (on May 4, 1951) Feynman wrote: “I wish to deny the 
correctness of the assumption that electrons act only on other electrons ... . So I think 
we guessed wrong in 1941. Do you agree?” 

11_R. P. Feynman, “Space-time approach to non-relativistic quantum mechanics,” Rev. 
Mod. Phys. 20 (1948) pp. 367-387 included here as an appendix. Also in SP, pp. 177— 
197. 

12 Letter to J. G. Valatin, May 11, 1949. 


Preface xvii 


eralization of quantum mechanics to apply to more general 
functionals than appears in the Review article. Finally the 
properties of a system interacting through intermediate har- 
monic oscillators is discussed in more detail. 


The introductory part of this third section of the thesis refers 
to Dirac’s classical treatise for the usual formulation of quantum 
mechanics. !% 

However, Feynman writes that for those classical systems, which 
have no Hamiltonian form “no satisfactory method of quantization 
has been given.” Thus he intends to provide one, based on the prin- 
ciple of least action. He will show that this method satisfies two 
necessary criteria: First, in the limit that fA approaches zero, the 
quantum mechanical equations derived approach the classical ones, 
including the extended ones considered earlier. Second, for a system 
whose classical analogue does possess a Hamiltonian, the results are 
completely equivalent to the usual quantum mechanics. 

The next section, “The Lagrangian in Quantum Mechanics” has 
the same title as an article of Dirac, published in 1933.'4 Dirac 
presents there an alternative version to a quantum mechanics based 
on the classical Hamiltonian, which is a function of the coordinates 
q and the momenta p of the system. He remarks that the La- 
grangian, a function of coordinates and velocities, is more funda- 
mental because the action defined by it is a relativistic invariant, 
and also because it admits a principle of least action. Furthermore, 
it is “closely connected to the theory of contact transformations,” 
which has an important quantum mechanical analogue, namely, the 
transformation matrix (q|qr). This matrix connects a representation 
with the variables q diagonal at time T with a representation having 
the q’s diagonal at time t. In the article, Dirac writes that (q:\q¢r) 


13 Pp. A. M. Dirac, The Principles of Quantum Mechanics (Oxford University Press, 
Oxford, 2nd edn., 1935). Later editions contain very similar material regarding the 
fundamental aspects to which Feynman refers. 

14 Pp. A.M. Dirac, in Physikalische Zeitschrift der Sowjetunion, Band 3, Heft 1 (1933), 
included here as an appendix. In discussing this material, Feynman includes a lengthy 
quotation from Dirac’s Principles, 2nd edn., pp. 124-126. 
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“corresponds to” the quantity A(tT), defined as 


A(tT) = exp [ - Lat 


A bit later on, he writes that A(tr) “is the classical analogue of 
(alar).” 

When Herbert Jehle, who was visiting Princeton in 1941, called 
Feynman’s attention to Dirac’s article, he realized at once that it 
gave a necessary clue, based upon the principle of least action that he 
could use to quantize classical systems that do not possess a Hamil- 
tonian. Dirac’s paper argues that the classical limit condition for 
h approaching zero is satisfied, and Feynman shows this explicitly 
in his thesis. The procedure is to divide the time interval t — T 
into a large number of small elements and consider a succession of 
transformations from one time to the next: 


(alar) = [fatten an al nt}don—a +> (aalan) daa (anlar). 


If the transformation function has a form like A(tT), then the in- 
tegrand is a rapidly oscillating function when fh is small, and only 
those paths (qr, q1,42,---;4:) give an appreciable contribution for 
which the phase of the exponential is stationary. In the limit, only 
those paths are allowed for which the action is a minimum; i.e., for 


which 6S = 0, with 
t 
S= | Ldt. 
T 


For a very small time interval ¢, the transformation function takes 
the form 


A(t,t + ¢) = expiLe/h, 


where L = L((Q — q)/e,Q), and we have let ¢ = q and Q = qe. 
Applying the transformation function to the wave function w(q, t) 
to obtain 7(Q,t + ¢) and expanding the resulting integral equation 
to first order in ¢, Feynman obtains the Schrédinger equation. His 
derivation is valid for any Lagrangian containing at most quadratic 
terms in the velocities. In this way he demonstrates two important 
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points: In the first place, the derivation shows that the usual results 
of quantum mechanics are obtained for systems possessing a classical 
Lagrangian from which a Hamiltonian can be derived. Second, he 
shows that Dirac’s A(tT’) is not merely an analogue of (q:|qr), but 
is equal to it, for a small time ¢, up to a normalization factor. For a 
single coordinate, this factor is N = \/2miceh/m. 

This method turns out to be an extraordinarily powerful way to 
obtain Feynman’s path-integral formulation of quantum mechanics, 
upon which much of his subsequent thinking and production was 
based. Successive application of infinitesimal transformations pro- 
vides a transformation of the wave function over a finite time inter- 
val, say from time T to time t. The Lagrangian in the exponent can 
be approximated to first order in ¢, and 


wane ff-- foo} gS [2 (F=Poon) tor] 


90490 ° ++ /Imddm 
N(ty —to)--: N(T —tm)’ 


x (qo, to) 


is the result obtained by induction, where Q = gdm4i,T = tm4i, and 
the N’s are the normalization factors (one for each q) referred to 
sete In the limit where € goes to zero, the right-hand side is equal 

aa T). Feynman writes: “The sum in the exponential resembles 
eae (q, @)dt with the integral written as a Riemann sum. In a similar 
manner we can compute w(qo, to) in terms of the wave function at a 
later time ...” 

A sequence of q’s for each ¢; will, in the limit, define a path of the 
system and each of the integrals is to be taken over the entire range 
available to each q;. In other words, the multiple integral is taken 
over all possible paths. We note that each path is continuous but 
not, in general, differentiable. 

Using the idea of path integrals as in the expression above for 
v(Q,T), Feynman considers expressions at a given time to, such 

s (f(qo)) = (x\f(qo)|¥), which represents a quantum mechanical 
matrix element if y and W are different state functions or an expec- 
tation value if they represent the same state (ie., x = W*). Path 
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integrals relate the wave function ~(qo, to) to an earlier time and the 
wave function y(qo,to) to a later time, which are taken as the dis- 
tant past and future, respectively. By writing (f(qo)) at two times 
separated by ¢ and letting ¢ approach zero, Feynman shows how to 
calculate the time derivative of (f(q:)). 

The next section of the thesis uses the language of functionals 
F'(q;), depending on the values of the q’s at the sequence of times 
t;, to derive the quantum Lagrangian equations of motion from the 
path integrals. It shows the relation of these equations to g-number 
equations, such as pg — qp = h/t and discusses the relation of the 
Lagrangian formulation to the Hamiltonian one for cases where the 
latter exists. For example, the well-known result is derived that 
HF — FH = (h/i)F. 

As was the case in the discussion of the classical theory, Feynman 
extends the formalism to the case of a more general action functional, 
beginning with the simple example of “a particle in a potential V(x) 
and which also interacts with itself in a mirror, with half advanced 
and half retarded waves.” An immediate difficulty is that the corre- 
sponding Lagrangian function involves two times. As a consequence, 
the action integral over the finite interval between times T; and 75 
is meaningless, because “the action might depend on values of x(t) 
outside of this range.” One can avoid this difficulty by formally let- 
ting the interaction vanish at times after large positive T> and before 
large negative T,. Then for times outside the range of integration 
the particles are effectively free, so that wave functions can be de- 
fined at the endpoints. With this assumption the earlier discussions 
concerning functionals, operators, etc., can be carried through with 
the more general action functional. 

However, the question as to whether a wave function or other 
wave-function-like object exists with the generalized Lagrangian is 
not solved in the thesis (and perhaps has never been solved). Al- 
though Feynman shows that much of quantum mechanics can be 
solved in terms of expectation values and transition amplitudes, at 
the end it is far from clear that it is possible to drop the very useful 
notion of the wave function (and if it is possible, it is probably not 
desirable to do so). A number of the pages of the thesis that follow 
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are concerned with the question of the wave function, with conser- 
vation of energy, and with the calculation of transition probability 
amplitudes, including the development of a perturbation theory. 

We shall not discuss these issues here, but continue to the last 
part of the thesis, where the forced harmonic oscillator is calcu- 
lated. Based upon the path-integral solution of that problem, parti- 
cles interacting through an intermediate oscillator are introduced and 
eventually the oscillators (i.e. the “field variables”) are completely 
eliminated. Enrico Fermi had introduced the method of represent- 
ing the electromagnetic field as a collection of oscillators and had 
eliminated the oscillators of longitudinal and timelike polarization to 
give the instantaneous Coulomb potential, as Feynman points out.!° 
That had been the original aim of the thesis, to eliminate all of the 
oscillators (and hence the field) in order to quantize the Wheeler— 
Feynman action-at-a-distance theory. It turns out, however, that the 
elimination of all the oscillators was also very valuable in field the- 
ory having purely retarded interaction, and led in fact to the overall 
space-time point of view, to path integrals, and eventually to Feyn- 
man diagrams and renormalization. 

We will sketch very briefly how Feynman handled the forced 
oscillator, using the symbol S' for the generalized action. He wrote 


bade} 


where Spo is the action of the other particles of the system of which the 
oscillator [a(t)] is a part, and y(t)zx is the interaction of the oscillator 
with the particles that form the rest of the system. If y(t) is a simple 
function of time (for example cos w;t) then it represents a given force 
applied to the oscillator. However, more generally we are dealing 
with an oscillator interacting with another quantum system and ¥(t) 
is a functional of the coordinates of that system. Since the action 
S — So depends quadratically and linearly on x(t), the path integrals 


15 Feynman mentions in this connection Fermi’s influential article “Quantum theory 
of Radiation,” Rev. Mod. Phys. 4 (1932) pp. 87-132. In this paper, the result is 
assumed to hold; it was proven earlier by Fermi in “Sopra l’elettrodynamica quantistica,” 
Rendiconti della R. Accademia Nazionali dei Lincet 9 (1929) pp. 881-887. 
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over the paths of the oscillator can be performed when calculating 
the transition amplitude of the system from the initial time 0 to the 
final time T. With 2(0) = 2 and 2x(T) = 2’, Feynman calls the 
function so obtained G,(z,a';T), obtaining finally the formula for 
the transition amplitude 


(xr|1|bo) s = i XT (Qm, ver 0l-M-IG, (a, x’; T)bo(Qo, 2’) 


x ded! 4m V9AQ0  ¥9dQ0 : 
NL sete 
where the Q’s are the coordinates of the system other than the 
oscillator. 

By using the last expression in the problem of particles interacting 
through an intermediate oscillator having 7(0) = a and 2(T) = 
3, Feynman shows that the expected value of a functional of the 
coordinates of the particles alone (such as a transition amplitude) can 
be obtained with a certain action that does not involve the oscillator 
coordinates, but only the constants a and £.!° This eliminates the 
oscillator from the dynamics of the problem. Various other initial 
and/or final conditions on the oscillator are shown to lead to a similar 
result. A brief section labeled “Conclusion” completes the thesis. 


Laurie M. Brown 
April 2005 


The editor (LMB) thanks to Professor David Kiang for his invaluable 
assistance in copy-editing the retyped manuscript and checking the 
equations. 


16 In the abstract at the end of the thesis this conclusion concerning the interaction of 
two systems is summarized as follows: “It is shown that in quantum mechanics, just as 
in classical mechanics, under certain circumstances the oscillator can be completely elim- 
inated, its place being taken by a direct, but, in general, not instantaneous, interaction 
between the two systems.” 


THE PRINCIPLE OF LEAST ACTION IN 
QUANTUM MECHANICS 


RICHARD P. FEYNMAN 


Abstract 


A generalization of quantum mechanics is given in which the cen- 
tral mathematical concept is the analogue of the action in classical 
mechanics. It is therefore applicable to mechanical systems whose 
equations of motion cannot be put into Hamiltonian form. It is 
only required that some form of least action principle be available. 

It is shown that if the action is the time integral of a function 
of velocity and position (that is, if a Lagrangian exists), the gener- 
alization reduces to the usual form of quantum mechanics. In the 
classical limit, the quantum equations go over into the correspond- 
ing classical ones, with the same action function. 

As a special problem, because of its application to electrody- 
namics, and because the results serve as a confirmation of the pro- 
posed generalization, the interaction of two systems through the 
agency of an intermediate harmonic oscillator is discussed in de- 
tail. It is shown that in quantum mechanics, just as in classical 
mechanics, under certain circumstances the oscillator can be com- 
pletely eliminated, its place being taken by a direct, but, in general, 
not instantaneous, interaction between the two systems. 

The work is non-relativistic throughout. 


I. Introduction 


Planck’s discovery in 1900 of the quantum properties of light led to 
an enormously deeper understanding of the attributes and behaviour 
of matter, through the advent of the methods of quantum mechanics. 
When, however, these same methods are turned to the problem of 
light and the electromagnetic field great difficulties arise which have 
not been surmounted satisfactorily, so that Planck’s observations still 
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remain without a consistent fundamental interpretation.! 


As is well known, the quantum electrodynamics that have been 
developed suffer from the difficulty that, taken literally, they predict 
infinite values for many experimental quantities which are obviously 
quite finite, such as for example, the shift in energy of spectral lines 
due to interaction of the atom and the field. The classical field the- 
ory of Maxwell and Lorentz serves as the jumping-off point for this 
quantum electrodynamics. The latter theory, however, does not take 
over the ideas of classical theory concerning the internal structure of 
the electron, which ideas are so necessary to the classical theory to 
attain finite values for such quantities as the inertia of an electron. 
The researches of Dirac into the quantum properties of the electron 
have been so successful in interpreting such properties as its spin and 
magnetic moment, and the existence of the positron, that is hard to 
believe that it should be necessary in addition to attribute internal 
structure to it. 

It has become, therefore, increasingly more evident that before 
a satisfactory quantum electrodynamics can be developed it will be 
necessary to develop a classical theory capable of describing charges 
without internal structure. Many of these have now been developed, 
but we will concern ourselves in this thesis with the theory of action 
at a distance worked out in 1941 by J. A. Wheeler and the author.? 

The new viewpoint pictures electrodynamic interaction as direct 
interaction at a distance between particles. The field then becomes 
a mathematical construction to aid in the solution of problems in- 
volving these interactions. The following principles are essential to 
the altered viewpoint: 

(1) The acceleration of a point charge is due to the sum of its in- 
teractions with other charged particles. A charge does not act on 
itself. 


1 Tt is important to develop a satisfactory quantum electrodynamics also for another 


reason. At the present time theoretical physics is confronted with a number of fun- 
damental unsolved problems dealing with the nucleus, the interactions of protons and 
neutrons, etc. In an attempt to tackle these, meson field theories have been set up in 
analogy to the electromagnetic field theory. But the analogy is unfortunately all too 
perfect; the infinite answers are all too prevalent and confusing. 

2 Not published. See, however, Phys. Rev. 59, 683 (1941). 
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(2) The force of interaction which one charge exerts on another is 
calculated by means of the Lorentz force formula, F = e[E+% x H], in 
which the fields are the fields generated by the first charge according 
to Maxwell’s equations. 

(3) The fundamental (microscopic) phenomena in nature are sym- 
metrical with respect to interchange of past and future. This requires 
that the solution of Maxwell’s equation to be used in computing the 
interactions is to be half the retarded plus half the advanced solution 
of Lienard and Wiechert. 

These principles, at first sight at such variance with elementary 
notions of causality, do in fact lead to essential agreement with the 
results of the more usual form of electrodynamics, and at the same 
time permit a consistent description of point charges and lead to a 
unique law of radiative damping. That this is the case has been 
shown in the work already referred to (see note 2). It is shown that 
these principles are equivalent to the equations of motion resulting 
from a principle of least action. The action function (due to Tetrode,? 
and, independently, to Fokker*) involves only the coordinates of the 
particles, no mention of fields being made. The field is therefore a 
derived concept, and cannot be pictured as analogous to the vibra- 
tions of some medium, with its own degrees of freedom (for example, 
the energy density is not necessarily positive.) Perhaps a word or 
two as to what aspects of this theory make it a reasonable basis for 
a quantum theory of light would not be amiss. 

When one attempts to list those phenomena which seem to in- 
dicate that light is quantized, the first type of phenomenon which 
comes to mind are like the photoelectric effect or the Compton ef- 
fect. One is however, struck by the fact that since these phenomena 
deal with the interaction of light and matter their explanation may 
lie in the quantum aspects of matter, rather than requiring photons 
of light. This supposition is aided by the fact that if one solves the 


3H. Tetrode, Zeits. f. Physik 10, 317 (1922). 
4 A.D. Fokker, Zeits. f. Physik 38, 386 (1929); Physica 9, 33 (1929); Physica 12, 145 
(1932). 
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problem of an atom being perturbed by a potential varying sinu- 
soidally with the time, which would be the situation if matter were 
quantum mechanical and light classical, one finds indeed that it will 
in all probability eject an electron whose energy shows an increase 
of hv, where v is the frequency of variation of the potential. In a 
similar way an electron perturbed by the potential of two beams of 
light of different frequencies and different directions will make tran- 
sitions to a state in which its momentum and energy is changed by 
an amount just equal to that given by the formulas for the Compton 
effect, with one beam corresponding in direction and wavelength to 
the incoming photon and the other to the outgoing one. In fact, one 
may correctly calculate in this way the probabilities of absorption 
and induced emission of light by an atom. 

When, however, we come to spontaneous emission and the mech- 
anism of the production of light, we come much nearer to the real 
reason for the apparent necessity of photons. The fact that an atom 
emits spontaneously at all is impossible to explain by the simple 
picture given above. In empty space an atom emits light and yet 
there is no potential to perturb the systems and so force it to make a 
transition. The explanation of modern quantum mechanical electro- 
dynamics is that the atom is perturbed by the zero-point fluctuations 
of the quantized radiation field. 

It is here that the theory of action at a distance gives us a different 
viewpoint. It says that an atom alone in empty space would, in fact, 
not radiate. Radiation is a consequence of the interaction with other 
atoms (namely, those in the matter which absorbs the radiation). 
We are then led to the possibility that the spontaneous radiation 
of an atom in quantum mechanics also, may not be spontaneous 
at all, but induced by the interaction with other atoms, and that 
all of the apparent quantum properties of light and the existence of 
photons may be nothing more than the result of matter interacting 
with matter directly, and according to quantum mechanical laws. 

An attempt to investigate this possibility and to find a quantum 
analogue of the theory of action at a distance, meets first the difficulty 
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that it may not be correct to represent the field as a set of harmonic 
oscillators, each with its own degree of freedom, since the field in 
actuality is entirely determined by the particles. On the other hand, 
an attempt to deal quantum mechanically directly with the parti- 
cles, which would seem to be the most satisfactory way to proceed, 
is faced with the circumstance that the equations of motion of the 
particles are expressed classically as a consequence of a principle of 
least action, and cannot, it appears, be expressed in Hamiltonian 
form. 

For this reason a method of formulating a quantum analog of sys- 
tems for which no Hamiltonian, but rather a principle of least action, 
exists has been worked out. It is a description of this method which 
constitutes this thesis. Although the method was worked out with 
the express purpose of applying it to the theory of action at a dis- 
tance, it is in fact independent of that theory, and is complete in 
itself. Nevertheless most of the illustrative examples will be taken 
from problems which arise in the action at a distance electrodynam- 
ics. In particular, the problem of the equivalence in quantum me- 
chanics of direct interaction and interaction through the agency of 
an intermediate harmonic oscillator will be discussed in detail. The 
solution of this problem is essential if one is going to be able to com- 
pare a theory which considers field oscillators as real mechanical and 
quantized systems, with a theory which considers the field as just a 
mathematical construction of classical electrodynamics required to 
simplify the discussion of the interactions between particles. On the 
other hand, no excuse need be given for including this problem, as its 
solution gives a very direct confirmation, which would otherwise be 
lacking, of the general utility and correctness of the proposed method 
of formulating the quantum analogue of systems with a least action 
principle. 

The results of the application of these methods to quantum elec- 
trodynamics is not included in this thesis, but will be reserved for a 
future time when they shall have been more completely worked out. 
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It has been the purpose of this introduction to indicate the motiva- 
tion for the problems which are discussed herein. It is to be empha- 
sized again that the work described here is complete in itself without 
regard to its application to electrodynamics, and it is this circum- 
stance which makes it appear advisable to publish these results as an 
independent paper. One should therefore take the viewpoint that the 
present paper is concerned with the problem of finding a quantum 
mechanical description applicable to systems which in their classi- 
cal analogue are expressible by a principle of least action, and not 
necessarily by Hamiltonian equations of motion. 

The thesis is divided into two main parts. The first deals with the 
properties of classical systems satisfying a principle of least action, 
while the second part contains the method of quantum mechanical 
description applicable to these systems. In the first part are also 
included some mathematical remarks about functionals. All of the 
analysis will apply to non-relativistic systems. The generalization to 
the relativistic case is not at present known. 


II. Least Action in Classical Mechanics 


1. The Concept of a Functional 


The mathematical concept of a functional will play a rather predom- 
inant role in what is to follow so that it seems advisable to begin 
at once by describing a few of the properties of functionals and the 
notation used in this paper in connection with them. No attempt is 
made at mathematical rigor. 

To say F is a functional of the function q(a7) means that F is a 
number whose value depends on the form of the function g(a) (where 
o is just a parameter used to specify the form of q(c)). Thus, 


r= i q(o)2e~® do (1) 


is a functional of g(o) since it associates with every choice of the 
function q(o) a number, namely the integral. Also, the area under 
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a curve is a functional of the function representing the curve, since 
to each such function a number, the area is associated. The expected 
value of the energy in quantum mechanics is a functional of the wave 
function. Again, 


F = q(0) (2) 


is a functional, which is especially simple because its value depends 
only on the value of the function q(o) at the one point a = 0. 

We shall write, if F is a functional of g(o), F[q(c)]. A functional 
may have its argument more than one function, or functions of more 
than one parameter, as 


F{x(t,s), y(t, s)] = [- [- x(t, s)y(t, s) sinw(t — s)dtds. 


A functional F[q(o)]| may be looked upon as a function of an 
infinite number of variables, the variables being the value of the 
function g(a) at each point co. If the interval of the range of o is 
divided up into a large number of points o;, and the value of the 
function at these points is q(o;) = q, say, then approximately our 
functional may be written as a function of the variables q;. Thus, in 
the case of equation (1) we could write, approximately, 


[oe) 
—o2 


1=— 00 


We may define a process analogous to differentiation for our func- 
tionals. Suppose the function q(c) is altered slightly to g(a) + A(c) 
by the addition of a small function A(o). From our approximate 
viewpoint we can say that each of the variables is changed from q; 
to q; + A;. The function is thereby changed by an amount 


S- Cegern) es 
F Ogi 
In the case of a continuous number of variables, the sum becomes 
an integral and we may write, to the first order in 4, 


Fla(o) + A(o)] — Fla(o)] = [K@xwae, (3) 
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where A(t) depends on F’, and is what we shall call the functional 
derivative of F' with respect to q at t, and shall symbolize, with 
Eddington,” by — It is not simply OF a) as this is in 
general infinitesimal, but is rather the sum of these ce over a short 
range of 7, say from i+ k to i —k, divided by the interval of the 
parameter, Oj4% — Oj—k- 

Thus we write, 


OF lq 
Fla(o) + Xo)] = a+ f ten that 
+ higher order ne in X. (4) 


For example, in equation (1) if we substitute q+ for q, we obtain 
Fla+ N= i TOE ONC EO Outaas 
= [ aoye*ao +2 / q(a)Mo)ew” do 


+ higher terms in \. 


Therefore, in this case, we have aid — =2q(t)e- ® Ina similar way, if 
F{a(o)| = ¢(0), os om = 6(t), where 6(t) is Dirac’s delta symbol, 
defined by f 5(t) f(t)dt = f(0) for any continuous function f. 


The eee a : for which ea is zero for all t is that function 
for which F' is an extremum. For example, in classical mechanics the 


of = i L(q(0),q(0))do (5) 


is a functional of g(a). Its functional derivative is, 


6A _ d f OL(q(t),a(t)) | _, AL(4(t), a(t) 
oq) af aq hy dq , 6) 


action, 


If & is an extremum the right hand side is zero. 


5 A. 8S. Eddington, “The Mathematical Theory of Relativity” (1923) p. 139. 
Editor’s note: We have changed Eddington’s symbol for the functional derivative to that 
now commonly in use. 
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2. The Principle of Least Action 


For most mechanical systems it is possible to find a functional, &, 
called the action, which assigns a number to each possible mechanical 
path, qi(o),q2(0)...qn(o), (we suppose N degrees of freedom, each 
with a coordinate qg,(o), a function of a parameter (time) o) in such 
a manner that this number is an extremum for an actual path g(c) 
which could arise in accordance with the laws of motion. Since this 
extremum often is a minimum this is called the principle of least 
action. It is often convenient to use the principle itself, rather than 
the Newtonian equations of motion as the fundamental mechanical 
law. The form of the functional </[q:(c)...qn(o)| depends on the 
mechanical problem in question. 

According to the principle of least action, then, if 
@\qi(o)...qn(o)] is the action functional, the equations of motion 
are N in number and are given by, 

OA 6A 6A 


out) ie) Say) ) 


(We shall often simply write Ol = 0, as if there were only one 


variable). That is to say if all the derivatives of <&, with respect to 
Qn(t), computed for the functions g,(o) are zero for all t and all n, 
then Gm(co) describes a possible mechanical motion for the systems. 

We have given an example, in equation (5), for the usual one 
dimensional problem when the action is the time integral of a La- 
grangian (a function of position and velocity, only). As another ex- 
ample consider an action function arising in connection with the 
theory of action at a distance: 


oo mar 2 
ob = is aS — V(a(t)) + k?a2(t)e(t + ay} dt. (8) 


It is approximately the action for a particle in a potential V(a), and 
interacting with itself in a distant mirror by means of retarded and 
advanced waves. The time it takes for light to reach the mirror from 
the particle is assumed constant, and equal to 7/2. The quantity 
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k? depends on the charge on the particle and its distance from the 
mirror. If we vary x(t) by a small amount, A(t), the consequent 


variation in & is, 
bef =f {ma(t)A(®) — Vel) AW + PAD HET) 
+ k?\(t + To) a(t)} dt 
= [mate - V'ep) - Bae + 1) 


by integrating 


a(t Te)}d(ae, YO 


so that, according to our definition (4), we may write, 


Eno —mi(t) — V'(x(t)) — k°#(t+ To) — k°#(t-— Tp). — (9) 
ba(t) 
The equation of motion of this system is obtained, according to (7) 
by setting ao} equal to zero. It will be seen that the force acting at 
time t depends on the motion of the particle at other time than ft. 
The equations of motion cannot be described directly in Hamiltonian 
form. 


3. Conservation of Energy. Constants of the Motion® 


The problem we shall study in this section is that of determining to 
what extent the concepts of conservation of energy, momentum, etc., 
may be carried over to mechanical problems with a general form 
of action function. The usual principle of conservation of energy 
asserts that there is a function of positions at the time t, say, and 
of velocities of the particles whose value, for the actual motion of 
the particles, does not change with time. In our more general case 
however, the forces do not involve the positions of the particles only 
at one particular time, but usually a calculation of the forces requires 


6 This section is not essential to an understanding of the remainder of the paper. 
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a knowledge of the paths of the particles over some considerable 
range of time (see for example, Eq. (9)). It is not possible in this 
case generally to find a constant of the motion which only involves 
the positions and velocities at one time. 

For example, in the theory of action at a distance, the kinetic 
energy of the particles is not conserved. To find a conserved quantity 
one must add a term corresponding to the “energy in the field”. The 
field, however, is a functional of the motion of the particles, so that 
it is possible to express this “field energy” in terms of the motion of 
the particles. For our simple example (8), account of the equations 
of motion (9), the quantity, 

m(a(t))? 


t+To 
B() = + vate) — ef eae ds 


+ k?a(t)¢(t + To) , (10) 


has, indeed, a zero derivative with respect to time. The first two 
terms represent the ordinary energy of the particles. The additional 
terms, representing the energy of interaction with the mirror (or 
rather, with itself) require a knowledge of the motion of the particle 
from the time t — Tp tot + Jo. 

Can we really talk about conservation, when the quantity con- 
served depends on the path of the particles over considerable ranges 
of time? If the force acting on a particle be F(t) say, so that the 
particle satisfies the equation of motion m#(t) = F(t), then it is 
perfectly clear that the integral, 


1) =f [mié(t) — F(t)]#(t)at (11) 


has zero derivative with respect to t, when the path of the particle 
satisfies the equation of motion. Many such quantities having the 
same properties could easily be devised. We should not be inclined 
to say (11) actually represents a quantity of interest, in spite of its 
constancy. 
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The conservation of a physical quantity is of considerable interest 
because in solving problems it permits us to forget a great number 
of details. The conservation of energy can be derived from the laws 
of motion, but its value lies in the fact that by the use of it certain 
broad aspects of a problem may be discussed, without going into 
the great detail that is often required by a direct use of the laws of 
motion. 

To compute the quantity I(t), of equation (11), for two different 
times, t; and tg that are far apart, in order to compare J(t,) with 
I(t2), it is necessary to have detailed information of the path during 
the entire interval t, to tg. The value of J is equally sensitive to the 
character of the path for all times between ¢; and tg, even if these 
times lie very far apart. It is for this reason that the quantity I(t) is 
of little interest. If, however, F’ were to depend on x(t) only, so that 
it might be derived from a potential, (e.g.; F = —V'(a)), then the 
integrand is a perfect differential, and may be integrated to become 
zm(&(t))? + V(x(t)). A comparison of J for two times, ty and tg, 
now depends only on the motion in the neighborhood of these times, 
all of the intermediate details being, so to speak, integrated out. 

We therefore require two things if a quantity I(t) is to attract 
our attention as being dynamically important. The first is that it be 
conserved, I(t;) = I(t2). The second is that I(t) should depend only 
locally on the path. That is to say, if one changes the path at some 
time t’ in a certain (arbitrary) way, the change which is made in I(t) 
should decrease to zero as t’ gets further and further from t. That 
is to say, we should like the condition — — O0as |t—t'| — co 
satisfied.” 


7 A more complete mathematical analysis than we include here is required to state 


rigorously just how fast it must approach zero as |t — t’| approaches infinity. The proofs 
states herein are certainly valid if the quantities in (12) and (20) are assumed to become 
and remain equal to zero for values of |t — t’| greater than some finite one, no matter 
how large it may be. 
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The energy expression (10) satisfies this criterion, as we have al- 
ready pointed out. Under what circumstances can we derive an anal- 
ogous constant of the motion for a general action function? 

We shall, in the first place, impose a condition on the equations of 
motion which seems to be necessary in order that an integral of the 
motion of the required type exist. In the equation OI = 0, which 
holds for an arbitrary time, t, we shall suppose that the influence of 
changing the path at time t’ becomes less and less as |t—t’| approaches 
infinity. That is to say, we require, 


eed 
dq(t)dq(t’) 


We next suppose that there exists a transformation (or rather, a 


0 as |t-T’| oo. (12) 


continuous group of transformation) of coordinates, which we sym- 
bolize by dn — Gn + n(a) and which leaves the action invariant 
(for example, the transformation may be a rotation). The trans- 
formation is to contain a parameter, a, and is to be a continuous 
function of a. For a equal to zero, the transformation should reduce 
to the identity, so that x,,(0) = 0. For very small a we may expand; 
In(a) = 0+ ay, +.... That is to say, for infinitesimal a, if the 
coordinates gy, are changed to qn + ayy the action is left unchanged; 


A\Gn(o)] = A\In(o) + ayn(o)] ; (13) 


For example, if the form of the action is unchanged if the particles 
take the same path at a later time, we may take, ¢,(t) > dn(t+a). In 
this case, for small a, dn(t) > dn(t) + agn(t) +... so that Yn = Gr(t). 

For each such continuous set of transformations there will be a 
constant of the motion. If the action is invariant with respect to 
change from q(t) to q(t + a), then an energy will exist. If the ac- 
tion is invariant with respect to the translation of all the coordinates 
(rectangular coordinates, that is) by the same distance, a, then a 
momentum in the direction of the translation may be derived. For 
rotations around an axis through the angle, the corresponding con- 
stant of the motion is the angular momentum around that axis. We 
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may show this connection between the groups of transformations and 
the constants of the motion, in the following way: For small a, from 
(13), we shall have, 


A\Gn(o)| = A\In(o) RG ayn(o)| : 


Expanding the left side with respect to the change in the coordinate 
aYn(o), according to (4) to the first order in a we have, 


Nf 6a 
aslo) + ayn(o)] = lato) + 0” i: lsat (14) 


and therefore, on account of (13), 


N ore) 

x Fac ela =0),; (15) 
54n(o) 

nai te 


Now consider the quantity, 


N/T 
1) => 4 inl) go da (16) 


On account of (15) we can also write, 


N fore) 
(cD | no) do. (17) 


7 
Consider the derivative of J with respect to T; an = 
eda 3a According to the equations of motion (7), this is 
seen to vanish, so that I(T) is independent of T for the real motion, 
and is therefore conserved. We must now prove, in order that it be 
acceptable as an important constant of the motion, that 


51(T) 
5dm(t) 


—0 as |[—t]|—oco for any m. (18) 
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Suppose first that t > 7. Let us compute a. directly from equa- 


tion (16), obtaining, 
ol(T) a OYn(o 
=+ do 
bdm(t) d, ddm( z ee a) 


T OW 
+f Coley Oo 


Now we shall suppose that y,(a7) does not depend very much on 


values of g(t) for times, t, far away from o. That is to say we shall 


assume,® 


dYn(o) 
OGm(t) 


In the first integral then, since t > 7’, and since only values of a less 


—0 as jo—t|—oo. (20) 


than J appear in the integrand, for all such values, t—o >t—T. As 
t — T approaches infinity, therefore, only terms in the first integral 
of (19) for which t — o approaches infinity appear. We shall suppose 
that aa decreases sufficiently rapidly with increase in t — o that 
the integral of it goes to zero as t — T’ becomes infinite. A similar 
analysis applies to the second integral of (19). Here the quantity 
Pe ka hes zero because of our assumption (12), and we 
Tgnlijoqatay @PProaches zero p 

shall suppose this approach sufficiently rapid that the integral vanish 
in the limit. 


Thus we have shown that —-. + O0ast—T— oo. To prove 


the corresponding relation for T — t — oo one may calculate 


with ¢ < T from (17), and proceed in exactly the same manner. In 
this way we can establish the required relation (18). This then shows 
that J(T) is an important quantity which is conserved. 

A particularly important example is, of course, the energy expres- 
sion. This is got by the transformation of displacing the time, as has 


8 In fact, for all practical cases which come to mind (energy momentum, angular mo- 


dyn(o) ; . 
> Sam(e) 1 20 


mentum, corresponding to time displacement, translation, and rotation) 
tually zero if o At. 
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already been mentioned, for which yn(o) = dn(a). The energy inte- 
gral may therefore be expressed, according to (16) (we have changed 
the sign), as, 


Jaane 5A 
B= - f Linger (21) 


In our example (8) we would get from this formula, 


T 
F(T) =~— / es er OEE Conner re 


— k?#(o — Tp)|do , (22) 


from which (10) has been derived by direct integration. 


4. Particles Interacting through an Intermediate Oscillator 


The problem we are going to discuss in this section, since it will give 
us a good example of a system for which only a principle of least 
action exists, is the following: Let us suppose we have two particles 
A and B which do not interact directly with each other, but there 
is a harmonic oscillator, O with which both of the particles A and 
B interact. The harmonic oscillator, therefore serves as an interme- 
diary by means of which particle A is influenced by the motion of 
particle B and vice versa. In what way is this interaction through 
the intermediate oscillator equivalent to a direct interaction between 
the particles A and B, and can the motion of these particles, A, B, 
be expressed by means of a principle of least action, not involving 
the oscillator? (In the theory of electrodynamics this is the problem 
as to whether the interaction of particles through the intermediary 
of the field oscillators can also be expressed as a direct interaction at 
a distance.) 

To make the problem precise, we let y(t) and z(t) represent co- 
ordinates of the particles A and B at the time ¢t. Let the La 
grangians of the particles alone be designated by L, and Lz. Let them 
each interact with the oscillator (with coordinate x(t), Lagrangian 
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(¢* — w*x?)) by means of a term in the Lagrangian for the entire 
system, which is of the form (J, + I,)a, where I, is a function in- 
volving the coordinates of atom A only, and J, is some function of 
the coordinates of B. (We have assumed the interaction linear in the 
coordinate of the oscillator.) 

We then ask: If the action integral for y, z, x, is 


“2 2.2 
[ |but tet (A - ) + y+ te] dt, (23) 


is it possible to find an action &, a functional of y(t), z(t), only, 


such that, as far as the motion of the particles A, B, are concerned, 
(i.e., for variations of y(t), z(t)) the action & is a minimum? 

In the first place, since the actual motion of the particles A, B, 
depends not only on y, z, initially (or at any other time) but also 
on the initial conditions satisfied by the oscillator, it is clear that @# 
is not determined absolutely, but the form that ./ takes must have 
some dependence on the state of the oscillator. 

In the second place, since we are interested in an action princi- 
ple for the particles, we must consider variations of the motion of 
these particles from the true motion. That is, we must consider dy- 
namically impossible paths for these particles. We thus meet a new 
problem; when varying the motion of the particle A and B, what 
do we do about the oscillator? We cannot keep the entire motion of 
the oscillator fixed, for that would require having this entire motion 
directly expressed in the action integral and we should be back where 
we started, with the action (23). 

The answer to this question lies in the observation made above 
that the action must involve somehow some of the properties of the 
oscillator. In fact, since the oscillator has one degree of freedom it will 
require two numbers (e.g. position and velocity) to specify the state 
of the oscillator sufficiently accurately that the motion of the particles 
A and B is uniquely determined. Therefore in the action function 
for these particles, two parameters enter, which are arbitrary, and 
represent some properties of the motion of the oscillator. When the 
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variation in the motion of the particles is taken, these quantities 
must be considered as constants, and thus, it is the properties of the 
oscillator described by these quantities which are considered fixed for 
the “impossible” motion of the particles. We should not be surprised 
to find that the action function for the particles depends on which 
properties of the oscillator are considered to be held fixed in the 
variation of y, and z. It is, on the other hand, somewhat unexpected 
that, as we shall see, not all possible conditions on the oscillator give 
rise to motions of y and z which are simply expressible in terms of 
an action principle. Let us see how this works out in detail. 

We shall assume that the functions I, and I, are zero for times t¢ 
greater than T and less than 0. We shall also assume (for simplicity 
only) that I, is a function of t and y(t) only, and does not depend 
on y(t). Similarly, I, is to be independent of 2(t). Then, from (23), 
the equation of motion of particle y is, 


dO g\. Oly 01, 
a (at) - aE MO oe 


with a similar equation for z. That for the oscillator is, 
mi + mwa = [I,(t) + L(t). (25) 
The solution of this last equation is, where we have written 7 = Iy+TI, 


x(t) = x(0) coswt + = + ah y(s)sinw(t — s)ds. (26) 


mw 


This may be expressed in other ways, for example, 


a(t) = melt (0 - — sin ws7(s)ds 
Ey | cs mr [ smecr- s)r(s)ds| » (27) 
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or again, 


1 ' : 
ACA nor [Rrsinwt + Rosinw(T — t)] 


SMe / sin w(t _— s)y(s)ds ; (28) 


where we have written 


i= ; (0 + 2(T)coswT — «(T) vet (29) 
and 
me ; a(n) AOveneT = oe . (30) 


It is seen that Rr is the mean of the coordinate of the oscillator at 
time T and what that coordinate would have been at this time if the 
oscillator had been free and started with its actual initial conditions. 
Similarly, Ro is the mean of the initial coordinate and what that 
coordinate would have had to be, were the oscillator free, to produce 
the actual final conditions at time 7. Outside the time range 0 to T 
the oscillator is, of course, simply a free oscillator. 

These expressions for x(t) may be substituted into (24) and the 
corresponding equation for z, to obtain various expressions for the 
motion of the particles y, z; each expressed in terms of these particles 
and two parameters connected with the oscillator. For the expression 
(26) these parameters are x(0) and «(0); for (27) they are x(0) and 
x(T); for (28) they are Ro and Rr. We should like to determine 
whether these expressions could be obtained from an action for y 
and z only. 
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If the action be &, then the expression (24) must be of the form 
6D 


0 ae 0. That is to say, we must have, 


OL 
_ i - a(t). (31) 


t 


6A _@ OL OL 
dy(t) dt 


7 Oy)” Oy 


t 


We seek the solution of this expression for each expression we may 
write for x(t). 

Now an equation such as (31) (which is really an infinite set 
of equations, one for each value of t) does not always have a so- 


lution. One of the necessary requirements is, since TO} (24) — 


ni (2%), that, 

§_[_d (aby) , Oy] , Oy 
dy(s) | dt \ Oy Oy |, Oy 
__8 [a (aby) | aby 
— dy(t) | ds \ dy Oy 


This requires, therefore, if s ~ t, that x(t) satisfy, 


+ -2(0) 


; o(9)). 


t 
Oly 


s 


OLy| da(t) _ Oly| dx(s) (32) 
Oy |,dy(s) Oy |, dy(t) 
For the expression (26), we have, 
ony) Be SUS Bio ie ane ae a ae 
dy(s) mw Oy |, (33) 


=0 if s>t. 


With this expression equation (32) is not satisfied so that we may 
conclude that no simple action function will describe the motion 
of the particles A and B if the initial position and velocity of the 
oscillator are considered as fixed. 
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On the other hand, since for the expression (27) we get, 


da(t) __ sinw(T'— t) sinws Oly ip get 
dy(s) mw sin wT Oy |, 
=, mee) em ene ly Pasa: (34) 
mw sin wT Oz |, 


we may conclude, since (34) satisfies (32), that an action does exist 
if the oscillator has a given initial and a given final position. In fact, 
we may solve equation (31) with x(t) expressed as in (27) and obtain, 
as an expression for the action, 


we fe Ane: jes ee ant) (oat 


sinwT’ 


1 iy t 
| at f ds -sinw(T — t)sinwsy(s)y(t). (35) 


muwsinwT Jo 


The motion of the particles is given by this action principle by 
requiring that it be a minimum for variations of y(t) and z(t), where 
the quantities (0) and x(T) are considered as fixed constants (for 
example, they might be zero). 

In the case that x(t) is given by the expression (28) with Ro and 
Rr as constants, we find, 


Ria) Hie sin w(t — s) ; if s< 
(36) 
1 Ol, 
Fy nwlt s) By |. iL 8 Shy 


so that the relation (32) is satisfied for this case also. The action 
function for this case is, 


ia 
CC fe y+ L,\dt+ / [Rr sin wt + Ro sinw(T — t)]y(t)dt 


sinwT 


4 af [ sin w(t — s)y(t)y(s)dsdt . (37) 
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This action is of particular interest inasmuch as in the special case 
Rr = Ro = 0 the integrand no longer depends on T,, and the limits 
of integration of 0 to T may be replaced without error by —oo and 
oo so that we obtain an action of the special form, 


ye / 7 [by + Leldt + — is . i Saeaaaten@asde: 
(38) 


If, now, after having passed to the limit T = oo, we assume that y(t) 
and Ly, L, do not depend on t explicitly, the substitution y(t) —- 
y(t + a) and z(t) > z(t+ a) does not alter the action function, so 
that an energy expression exists for this action. (In electrodynamics 
it leads to the half advanced plus half retard interaction used in the 
action at a distance theory.) 

Exactly the same formulas result if it is that y(t) depends on the 
function y(t), as any general functional. The action of the particles 
need not be the integral of a Lagrangian in the original form (23) 
either. If there are more than one intermediate oscillator, and if the 
oscillators are independent (i.e. if no two of the oscillators interact 
directly) the expressions for the action, with the oscillators elimi- 
nated are similar to (35), (37) and (38), except that there is a sum of 
interaction terms, one for each oscillator. Thus, if the frequency of 
the jth oscillator is w;, its mass m;, the interaction with the particles 
given by 7; and there were N oscillators, (38), for example, would 
read, 


lee) N lee) t 
1 3 
A = fen let + Dinas [. ie sin w;(t — s) 


x 7; (t)7;(s)dsdt . (39) 


By compounding terms of the form (39) many different types of inter- 
action may be obtained. For example, the interaction of equation (8) 
would result from (39) if we had only one particle x, (instead of y and 
z), and an infinite number of oscillators of unit mass (4w sin wT'dw 
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oscillators with frequency w in the range w to w + dw), each inter- 
acting with the particle through the function 7;(t) = #(t) (the same 
for each oscillator). 

If we look a little more closely at the type of interaction of (38) we 
see that since y(t) = I,(t) + I,(t) the interaction contains terms of 
the form I,(t)I,(s) and I,(t)I,(s) as well as I,(t)I,(s) and I,(t)I,(s). 
Although the latter represent interactions between the particles A 
and B, the former type of term represents an interaction of particle 
A with itself, and particle B with itself, so to speak. What type 
of intermediate oscillator system will lead to interactions between 
particles, and to no interaction of a particle with itself? 

This question is easily answered, when it is observed that if, in 
expression (39) for example, we take two oscillators, 7 = 1, and 
j = 2, so that w) = w2 = w; my = —mM2 =m; nN = 1, 4+ I,; and 
99 = Iy—Iz, since, 11(s)u(t)—79(s)7a(t) = 2(Iy(s)Ie(t)+L:(s) y(t), 
the interaction (39) could be written as, 


A= fie [Ey + Ljdt+—— [ if sin w(t — s) 
x [Iy(s)Lz(t) + I.(s)Iy(t)|dsdt (40) 


representing interactions of the particles with each other and having 
no “self-action” terms. Exactly the same procedure leads to the same 
results in the cases (35) and (37). 

The original action, involving the oscillators, which leads to a 
form of this kind is, from what we have said, 


ioe) 
/ ty +L, + (i, Ty)x1 (cL, Ty)x2 


—oo 


m.. m,. 
+ — (4 Sgt) = 5 (#2 —wW *13) dt. 


This may be written, by putting n, = 71 + 2 and my = 21 — £2, 


CO m . . 
v [zy + Lz + Lenz + Lyny + a (Hye = w?nyne)| dt . 
—oo 
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This may be immediately generalized to the case where there are a 
number of particles y;. The action, 


Wrz. 24 
Sen + Ty, Myx) + Ss" S> > (thy.thy — MyM) | at 
2 


TO | & k xk 


will lead to interactions only between pairs of particles k, 1, no terms 
arising corresponding to the action of a particle on itself. 

These action expressions will be of importance in the next part 
of the paper when we discuss them quantum mechanically. Start- 
ing with a system with a Hamiltonian, we have, at least classically, 
found a corresponding non-Hamiltonian action principle, by leaving 
out one member of the system. We have, in this, a way of checking a 
description which is intended as a generalization of quantum mechan- 
ics. We may start with a Hamiltonian system, where the quantum 
mechanics is well known, and show that by suitable elimination of 
the intermediate oscillator we get a system whose classical analogue 
obeys an action principle of the type (35) or (37). We do this on 
page 62. 


III. Least Action in Quantum Mechanics 


Classical mechanics is the limiting form of quantum mechanics when 
Planck’s constant, A, is considered as being vanishingly small. The 
classical system analogous to a quantum mechanical system (when 
such an analogue exists) may be mathematically exhibited most 
directly by letting A approach zero in the quantum mechanical 
equations. 

The inverse problem, that of determining a quantum mechani- 
cal description of a system whose classical mechanical behaviour is 
known, may not be so easily solved. Indeed, the solution cannot be 
expected to be unique; witness, for example, the Klein-Gordon equa- 
tion and Dirac’s equation for the relativistic behaviour of an electron, 
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both of which have the same classical analogue and quite different 
quantum mechanical consequences. 

There exist, however, very useful rules applicable when the clas- 
sical equations may be put into Hamiltonian form, and conjugate 
coordinates and momenta may be defined. These rules, leading to 
Schroddinger’s equations, or Heisenberg’s matrix formulation are too 
well known to require description here.® 

For classical systems, such as those described in the first part 
of this paper, which in general have no Hamiltonian form, and for 
which conjugate momenta and coordinates cannot be defined, no 
satisfactory method of quantization has been given. In fact, the usual 
formulations of quantum mechanics use the concepts of Hamiltonian, 
and of momentum in such a direct and fundamental way that it would 
seem almost impossible to do without them. 

A formulation of quantum mechanics will be described here which 
does not require the ideas of a Hamiltonian or momentum operator 
for its expression. It has, as its central mathematical idea, the ana- 
logue of the action integral of classical mechanics. It is a solution to 
the problem of the quantization of the classical theory of least action 
described in the first part of the paper. 

A generalization of quantum mechanics which is to apply to a 
wider range of classical systems must satisfy at least two conditions. 
First, in the limit as fh approaches zero the quantum mechanical 
equations must pass over into classical equations of motion applica- 
ble to systems in this widened range. And, second, they must be 
equivalent to the present formulations of quantum mechanics appli- 
cable to classical systems with Hamiltonia. The form of quantum 
mechanics to be described below does indeed satisfy both these cri- 
teria for systems for which a principle of least action exists. As an 
additional argument we shall show that the action principles arising 


9 They are discussed very satisfactorily by Dirac in his book, “The Principles of Quan- 


tum Mechanics” (Oxford 1935) on page 88, and on page 119. 
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in classical theory from the elimination of an intermediate harmonic 
oscillator arise in an analogous way in quantum mechanics. 


1. The Lagrangian in Quantum Mechanics 


A description of the proposed formulation of quantum mechanics 


might best begin by recalling some remarks made by Dirac!® 


con- 
cerning the analogue of the Lagrangian and the action in quantum 
mechanics. These remarks bear so directly on what is to follow and 
are so necessary for an understanding of it, that it is thought best to 
quote them in full, even though it results in a rather long quotation. 
Speaking of the transformation function (q|¢) connecting the rep- 
resentations referring to two different times ¢ and T,, he says, “From 
the analogy between classical and quantum contact transformation 

. we see that (q/|g/,) corresponds in the classical theory to eiS/h 
where S' is Hamilton’s principal function for the time interval T to 


t, equal to the time-integral of the Lagrangian L, 


s= f bat. (21) 


Taking an infinitesimal time interval t to t+ dt, we see that (q/, 5,|q)) 
corresponds to e“h. This result gives probably the most funda- 
mental quantum analogue for the classical Lagrangian function. It is 
preferable for the sake of analogy to consider the classical Lagrangian 
as a function of the coordinates at time ¢t and the coordinates at time 
t+t, instead of a function of the coordinates and velocities at time t. 

There is an important action principle in classical mechanics con- 
cerning Hamilton’s principal function (21). It says that this function 
remains stationary for small variations of the trajectory of the sys- 
tem which do not alter the end points, i.e. for small variations of the 
q’s at all intermediate times between T' and t with qr and q fixed. 
Let us see what it corresponds to in the quantum theory. 


10 Dirac, “The Principles of Quantum Mechanics”, p. 124-126. 
Editor’s note: Very similar material is contained in the Third Edition (1947), p. 125-130. 
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Put 
i ft a 
exp {z if Lat} = exp {r5tt, ta) = Bites ta) (22) 
ta 


so that B(ty,ta) corresponds to (q},|qj,,) in the quantum theory. Now 
suppose the time interval T’ — t to be divided up into a large number 


of small time intervals T — t,t, — to,...,tm—1 — tm, tm — t, by 
the introduction of a sequence of intermediate times t1,to,...,tm. 
Then 
BOT) = BG) B Gast. Dah Bi) (28) 
The corresponding quantum equation, which follows from the com- 
position law ..., is 
(a ldr) = ff [dlatn) dy tla) 
x (alg )dai (air) (24) 


q, being written for qj, for brevity. At first sight there does not 
seem to be any close correspondence between (23) and (24). We 
must, however, analyse the meaning of (23) rather more carefully. 
We must regard each factor B as a function of the q’s at the two 
ends of the time interval to which it refers. This makes the right- 
hand side of (23) a function, not only of q and qr, but also of all 
the intermediate q’s. Equation (23) is valid only when we substi- 
tute for the intermediate q’s in its right-hand side their values for 
the real trajectory, small variations in which leave S(t,T) station- 
ary and therefore also, from (22), leave B(t,T) stationary. It is the 
process of substituting these values for the intermediate q’s which 
corresponds to the integrations over all values for the intermediate 
q’s in (24). The quantum analogue of Hamilton’s action principle 
is thus absorbed in the composition law (2/) and the classical re- 
quirement that the values of the intermediate q’s shall make S(t, T) 
stationary corresponds to the condition in quantum mechanics that 
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all values of the intermediate q’s are important in proportion to their 
contribution to the integral in (24). 

Let us see how (23) can be a limiting case of (24) for h small. We 
must suppose the integrand in (24) to be of the form e*”/", where F is 
a function of gp, q,,95--- Gn, dq Which remains continuous as h tends 
to zero, so that the integrand is a rapidly oscillating function when 
his small. The integral of such a rapidly oscillating function will be 
extremely small, except for the contribution arising from a region in 
the domain of integration where comparatively large variations in the 
qj, produce only very small variations in F’. Such a region must be the 
neighborhood of a point where F' is stationary for small variations 
of the q,. Thus the integral in (24) is determined essentially by the 
value of the integrand at a point where the integrand is stationary 
for small variations of the intermediate q’’s, and so (24) goes over 
into (23).” 

We may now point out that not only does (q/, 5,|q;) correspond to 
exp $ [x (See. dss) 6t], where L(q,q) is the Lagrangian for the 
classical system considered as a function of velocity and coordinate, 
but that often it is actually equal to it, within a normalization con- 
stant, in the limit as 6t approaches zero. That is, to terms of order 
ot, if /9(q) dq is the volume element in q-space 


W(Q+64t + dt) = flckssula old? 9(ah) dq; 


idt a q ! d : 
= few id ber ( ine thse) ab(ql,, t) vie 


since 4¢ and q; are just different variables it might be advantageous 


to avoid the subscripts, and to write;! (Q = di 5.9 = q) 


fe ist 1 (Goa Duggan = (Q,t + ot) (41) 


t es 
11 One could just as well write L (Bene g i) for 1 ( Seegentt 


being of a higher order of smallness than we are concerned with. 


sis) the difference 
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as 6t — 0 to the first order in 6t. This is an integral equation to 
determine the wave function v(Q,t+t) for the system at the time t 
in terms of its value at time t. It therefore serves the same function as 
Schrodinger’s equation, and indeed is equivalent to that equation if 
the normalization constant, A(dt), a function of dt is chosen suitably. 

To see how this comes about, we take the simplest case of a par- 
ticle of mass m moving in one dimension in a force field of potential 
V(a). Thus the classical Lagrangian function is L = 3mé? — V(a). 
In accordance with equation (41), then the wave function for this 
system must satisfy (where we have written ¢ for dt) for infinitesimal 
€, the equation, 


wette= fe EVO wy A (42) 
Let us substitute y = 7+ x in the integral; 
ae 2 
W(a,t+e) = [a Oye+noF. (43) 


Only values of 7 close to zero will contribute to the integral, be- 
cause, for small ¢, other values of 7 make the exponential oscillate so 
rapidly that there will arise little contribution to the integral. We are 
therefore led to expand v(x + 7,t) in a Taylor series around 7 = 0, 
obtaining, after rearranging the integral, 


SEV) aay 
W(a,t+e)= a [ct fue +e 


am_ 2 


Now, yes eh2eT -dn = 4/ Zelie! (see Pierces integral tables 487), and 


by differentiating both sides with respect to m, one may show 


i Wn? - ek3e" dy = Qrhei het 
m m 


—co 
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The integral with 7 in the integrand is zero since it is the integral of 
an odd function. Therefore, 


2rhei 
m ve het Ory 
= —=V(a) ee Ae : 2 
U(a,t+e) aoe {wen an Oe + terms in € eto 
(44) 


The left hand side of this, for very small ¢ approaches w(x, t) so that 


for the equality to hold we must choose, 


Ale) = ( (45) 


Expanding both sides of (44) in powers of € up to the first, we find, 


hie 0?» 


2m Ox2’ 


vet) +e = (e,t) — Ev(ayya,t) + 


and therefore, 


aoe = ea +V(x)y 
i Ot 2m Ox? 

which is just Schrodinger’s equation for the system in question. 

This confirms the remark that equation (41) is equivalent to 
Schrédinger’s differential equation for the wave function w.!? Thus 
given a classical system describable by a Lagrangian which is a func- 
tion of velocities and coordinates only, a quantum mechanical de- 
scription of an analogous system my be written down directly, with- 
out first working out a Hamiltonian. 


12 We have, of course only proved the equivalence in a very special case. It is apparent, 
however, that the proof may be readily extended to any Lagrangian which is a quadratic 
function of the velocities, in a addition to, perhaps, some terms linear in the velocity 
arising from magnetic fields. The equivalence will be shown in a more general manner 
later. 
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If the problem involves more than one particle, or particles with 
more than one degree of freedom, equation (41) remains formally 
the same, if g and @ are considered to represent entire sets of coor- 
dinates, and [ /g dq represents the volume integral over the space 
of these coordinates. The form of Schrédinger’s equation which will 
be arrived at will be definite and will not suffer from the type of am- 
biguity one finds if one tries to substitute Be for pq in the classical 
Hamiltonian. 

It will be important to consider the consequences of (41) when 
applied to connect the wave function at one time to its value at some 
finite later time. 

Suppose we know the wave function ~(qo, to) at time to, and desire 
it at time TJ’. Dividing the time interval up into a very large number 
of small time intervals to to t1, t1 to to,..., tm to T, and applying 
relation (41) to each time interval we obtain, if we write q; for the 
coordinate associated with time t;, 


EL (4 gist) -(tis1—-t) VIG) AG IG) dai 
w(qi+1, tit1) ~ fel (ft tj i gis) -( +1— - (qi, ti) ; 4 a 
AG = 
(46) 
Thus, by induction we find the relation, 


woana ff foo aS [e(28=8.0) toa 


A(t, — to) - A(tg —t1)-...- A(T — tm)’ 


BH 


x (qo, to) 


where in the sum we shall write Q for gm41, and T for t,41. In the 
limit as we take finer and finer subdivisions of the interval to to T 
and thus make an ever increasing number of successive integrations, 
the expression on the right side of (47) becomes equal to ~(Q,T). 
The sum in the exponential resembles ie L(q,q)dt with the integral 
written as a Riemann sum. 
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In a similar manner we can compute (qo, to) in terms of the wave 
function at the later time T’ = t41, by the equation; 


W* (qo, to) = ff fens tna) 


i< G+1 — Gi 
x exp {; oz [( $24 ass) “(ipa = “| 
V9m+1ddm4i + fig (48) 


A(tm+1 rs tra) eae! A(ty = to) 


2. The Calculation of Matrix Elements in the 
Language of a Lagrangian 


Suppose we wish to compute the average value of some function f(q) 
of the coordinates, at the time to, which we shall call (f(qo)); 


ich ‘| 18" (gosto) f (40) (40, to) «/G0dQ0 (49) 


Let us try to express this in terms of the wave function at some more 
future time, t = tm+41, by equation (48), and in terms of the wave 
function W(q_m,t—m’) at some earlier time t = t_,,, by an equation 
analogous to (47) (we shall let negative indices stand for times earlier 
than to). We obtain 


(#(00)) = fff 8° Canaastmss) 


x exp tj S~ E (= ass) (ti — ul - f (qo) 


: +1 
i=—m! 


VI ddm-+1 tee V9 dq0./9 dq-1 see Jgdq—m! 
A(tm+1 = tie) eas A(to = t-4) ots Altai = beh) : 
(50) 


W(q—m', tow) i 


We shall be dealing with expressions of this form to a considerable 
extent, so that we shall make some remarks concerning it. In the 
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first place, we will always take t_,, to be a fixed time 7; very far 
in the past, and t,,41 to be To, a time very far in the future. Sec- 
ondly, we may generalize somewhat by taking the wave function at 
T> arbitrarily, say y, and not necessarily the same as what the wave 
function w chosen at time TJ; would become at this later time TJ». 
In this way we shall have an expression more like a matrix element 
than an average. We shall use the symbol (y|f(qo)|w) to represent 
this quantity. We shall continue to call it simply the average of f 
even though it more closely resembles a matrix element. We may, 
of course calculate quantities such as (y|f(qi)|w), which correspond 
to the average value of f(q) at the slightly later time, t;, merely by 
replacing the f(qo) appearing in the integral on the right hand side 
of (50) with f(q.). Thus, we may find the time rate of change of the 
average value of f(q), as, 


(xlFlald) = OxlF(go)lY) _ Papacy = F(qo) 


d 
aH xlF@ly) = ftp am 


where the last expression means that we are to replace f(qo) in the 
integral of (50) by Ha) flao) We may say then, that the symbol 
(x| |W) means that the quantity inside is to be multiplied by an 
exponential of the form of equation (50), by a wave function x at 
To, and one, w, at T,, and integrated over all coordinates. Finally, 
the limit as the subdivisions of time become finer and finer is to be 
taken. We shall, for a while, disregard which wave functions are put 
into the expression, and shall simply write, (|f|), but we shall discuss 
this later. In this way we can define the average of F'(q(c)| where 
F is any functional! at all of g(a). We need merely to express the 
functional approximately as a function of the values, q;, of g at the 
points t;, place this function in the integral of (50) and pass to the 
limit. 


13 Properties of functionals and our notation with regard to them is described in the 
first section (p. 7) of the paper. 
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3. The Equations of Motion in Lagrangian Form 


Let us now consider some functional, which, expressed in terms of qj, 
the values of q at the times t; is F'(q;), - that is to say, a function of 


O(./gnF : 
-- + G-1590; 91; 92--- etc. Let us calculate (a aye) Replacing 
f by ae in (50) we see that we may integrate by parts with 


respect to gy. The integrated part may be assumed to vanish, for if 
we consider that the integrations over the other q’s were performed 
the remaining integrand would be similar to the square of the wave 
function at time t,, which presumably vanishes at infinity. We find 
after integrating by parts, an expression similar to (50) but having 
a different form for f(qo); namely we find, 


( 1 a fal *\ 


G4) Og 
=i (lB {S [2 (S80) tons). 


(52) 


Thus performing the indicated differentiation we obtain, 


(ka oP 


a dk+1 — Uk dk — G-1 
an he) fe | eae By AR pe ee: Ll 
“(| { [2A cen] (22 as) 
(tha) I, (£2 a) WW), (53) 
te — te-1 


where we have written Ly for the function oe, and Lg for gE. The 


expression in { } may be remembered most easily by noting that its 
limit for infinitesimal subdivisions is 


a (ab) at) , 
dt \ dq Oq ; 
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This relation (53) is fundamental in that, when compared to cor- 
responding expressions in the usual form of quantum mechanics, it 
contains, as we shall see, in one equation, both the equations of mo- 
tion and the commutation rules for p and gq. The way this comes 
about can be seen most clearly by applying equation (53) to the 
simple example with L = mi —V(z). It then reads, (,/g = 1) 


OF \\ _# | eee red facia a 
OxE hh thi1 — tr 
-m( BA) + —nav'@n)). oH 
th — te-1 
If F = xz then (54) becomes; 
(in) = p(n (SE) oy — ng +m (EAE 
h tet — te te —te-1 


3 


In the limit of fine subdivision of the time, since as t;, —t,_1 — 0 the 


+ xp (th — th_1)V" (xr) 


last term becomes unimportant, we can write, supposing as usual 
that in all these equations the limit is to be taken, 


Tk+1 ~ Vk LE Lk-1 h 
oe ee re - =e aa =-(|1]). (55 
( (m tei — tr )o vk (m ae ) ) “Al |) ( ) 


This is equivalent to the statement, in the ordinary notation of quan- 


tum mechanics, that the average value of pq — gp is equal to the 
average value of AT . The order of the factors in the usual mechanics 
here shows up as the order of the terms in time. (Exact relations by 
which formulas of the notation of equation (55) can be translated into 
relations in the more usual notation will be given in a later section 
(page 40)). 

Again by equation (53) for F = rz43, say, we find, 


Ce41—- 2 Le — Lp— . sans 
LEZ (maki Fe ) — Lk43 (m= 7 ) ) = 0 in the limit. 
tki1 — tr te — te-1 


(56) 


> 
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Thus, the difference of two successive momentum measurements 
followed by a position measurement, multiplied and averaged, is in- 
finitesimal since the two successive momentum measurements give 
the same result — but if the position measurement occurs between 
the momentum measurements, in time, the results is no longer small, 
the position measurement having disturbed the momentum between 
the times the momentum was measured. (Of course, these quanti- 
ties cannot actually be looked upon as averages of quantities in the 


classical sense because of the 7 in the expressions.) 
TkT7Tk-1 


th—-th-1 ) |) by its value a mo- 


In Eq. (55) we may replace (|ax-m( 
Tk+1—Tk 
tk4i—tr 
of the expression by a finite amount. Thus (55) may be rearranged 


ment later, namely, (|e44 -m( )|) without changing the value 


to read, 


(l(te41 — &%)*|) = — ts — th) - (|1]). (57) 


This describes the well known fact that a wave packet spreads 
parabolically in time from a point, and that although the average 
value of the displacement of a particle in the time dt is vdt, where v 
is the mean velocity, the mean value of the square of this displace- 
ment is not of order dt?, but only of order dt. We mention it here to 
point out that although the form for average velocity, is, from equa- 


tion (51), (|e |), the average kinetic energy, for example, must 


be written (| (GHP - SH!) |) rather than (| (HA=#)")), 


thai-tr th—te-1 te4i-tr 


The latter is infinite. 
If, in (54), we had chosen for F' the expression G12,G2, where 


G is any function of the coordinates, x;, belonging to times ¢; later 
than tz (t; > tp), and G2 is any function of the coordinates belonging 
to times earlier than t,, we would have found, in place of equation 
(55), the relation, 


Lkt+t— zx Lk — Lp 
(6: (mE) ae ae (maT) 
tii — tr te — tp 


= 5 (\GGal). (58) 
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This is equivalent to the usual relation among averages, 


(IG (p9 — ap)G2l) = + (IG1G2)). 


Since G; and Gp» are arbitrary functions of their coordinates, we 
may think of equation (58) as equivalent to the operator equation 
pq — ap = . 

Replacing F simply by G1G»2 with G',, G2 defined as before, equa- 
tion (54) becomes after dividing through by (t, — t,—1) since the left 
side is zero, 


Suda + V"(xx)| Ge ste HOH) 


This is equivalent to the operator equation, in the usual notation of 
quantum mechanics, which is the quantum analogue of Newton’s law 
of motion, namely, ma + V’(x) = 0. This law and the commutation 
rules are of course equivalent to the commutation rules and the rules 
HF -FH = Ap with H = 54p* + V(z) in the usual formulation. 
Thus equations (58) and (59) state all that is needed to completely 
solve the problem for this system, and hence equation (54), or its 
generalization, equation (53), from which they can be derived is all 
that is required. 

Even though we are going to extend this to problems for which 
no Hamiltonian exists, it may still be of interest to describe the 
Hamiltonian from our point of view, when one does exist. Let us 
consider the average of any functional F'(q;). To calculate the rate of 
change with the time of this quantity we may use a relation analogous 
o (51). Another method is as follows; suppose the variables g; which 
appear in F are limited to indices between the times t; and ty (J > 1’). 
That is to say, F is a functional of only the variables qj_1 to q@41. 
(In the limit as our subdivisions in time become infinite, 1,1’ may 
become infinite, but we want ¢; and tj; to remain bounded, so that 
F covers only a finite span of time). Now, if in our expression (50) 
for the average value of F' the values of the times t; for i between 
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1 and I’ were increased by a constant small amount 6 it would be 
equivalent to calculating our F at a time 6 later, with, however, 
the same wave functions kept fixed at fixed times, Ty and 7T,. This 
will give us just (|F|) + 64(|F|) to the first order in 6. Therefore, 
4 (|F'|) is the derivative with respect to 6 of the quantity we get by so 
augmenting the time variables. To compute the indicated quantities 
we look at equation (50) and notice that if all the times were altered 
as indicated, the only change made in the formula would be to replace 
ty — ty_1 by ty — tj_-1 — 6 and ty4, — ty by ty41 —ty +6. Doing this, 
and taking the derivative with respect to 6, we find, 


d a Ge G4 qi — U-1 
“rl —(\2,,(# 241 _ a te 
alin) = 7(|{ t( 2224 a) Se 


— gq h 
= 1($- a) polti ze nab 


44. — qu h 
-1( La.) + —a(ty44 -w)}) 5 (60) 
ty41 — ty i 


where a(dt) = a In A(dt). Equation (60) only applies if F’ contains 


only coordinates between q; and qj, and does not involve the time 
explicitly. If F’ does involve the time a term er (||) should be 
added to the right side. 

We may compare (60) to the usual relation 4(F) = +(HF-HF) 
and see that the analogue of H is the expression in the { } in this 
equation. The terms a(t, — tj_1) arise from the differentiation of 
the normalizing factor, A. They serve to keep the expression for H 
finite in the limit of infinitesimal subdivision, in spite of the fact that 


qd—WU—-1 


it may contain terms of the form ({( )*}) whose magnitude we 


t—ty_-1 
have already discussed. For example, in view of (57) for the simple 
Lagrangian mii” —V(z), it is seen that a must be CER and that, 
therefore, A(t; — t;-1) = const.,/t; — {_1, as we have already found 


for this case (Eq. (45)). 
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4. Translation to the Ordinary Notation of 
Quantum Mechanics 


What we have been doing so far is no more than to reexpress ordi- 
nary quantum mechanics in a somewhat different language. In the 
next few pages we shall require this altered language in order to de- 
scribe the generalization we are to make to systems without a simple 
Lagrangian function of coordinates and velocities. Before we do this, 
it is perhaps worth while to show how the relations we have derived 
up to this point, for systems possessing a Hamiltonian, H say, may 
be most readily translated into the more usual notation. 

The usual expression for the wave function at time t2 in terms of 
the wave function at time t, is given by means of the relation, 


Uty =, en tata, : (61) 


Thus the matrix element between a state which, at time T2, possesses 
the wave function x, and the state represented at time 7) by the wave 
function w, of an operator F considered acting at the time to is 


[ xttO-o pehlo—T Aya Vol : (62) 


Thus, for example, the average coordinate at time to between these 
states, which is what we have been calling (y|qo|w) is expressible as 


(xlaole) = f xk“ gekO-M yao! (63) 


Similarly, (x|qi|¥) is the matrix element of the operator 
eth (T-t)H ef H(u—T) 


, so that as t; — to — 0 the quantity, 


fi =ty 


(JB hu = [xetinm 
t; — to 


x en AT) Hyg Vo] 
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becomes [ yet rl (Te-t1) . + (Hq — qH)ek#—-TWasd Vol, and is there- 
fore the average value of the operator +(H q — qH), and represents 
velocity, as we have said. Other functional expressions can be made 
to correspond to operators. For example, if t,, > ti, 


(xlamaPlv) = f zeke tm) gen Htm—t) ex HiT) yd Vol (64) 


while if ty, < t; it is given by 


a 


(xldmaf |b) = / yer M(t) Per H(e-tm) gekH(im—T ahd Vol . 


(65) 
For example, equation (55) when translated becomes, if H = 
Imp’ + V(2), 


a 


[acto ; —{ (Hz —2H)x — «(Hx - vH) er! lte-T ad Vol 


i i, xen t(E—TMasd Vol 


a 


which, of course, is correct. The left side of equation (56) when 
translated with the aid of equations (64) and (65) does indeed vanish 
in the limit as the division of time into intervals becomes finer and 
finer. 

As a further example we may mention the equivalence between 
(\a(t) - f(x(t))|) and the average of 


h(t) | (f(x) + Fa)p)| eR) (66) 


as may be seen most easily by considering the average of 
Ui41-Li-1 
it1—ti-1 


and (65). 


(x;) according to obvious generalizations of equations (64) 


+ 
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5. The Generalization to any Action Function 


We now make the generalization to the case when the classical action 
need not be of the form & = f L(q,q)dt, but is some other more 
general functional of g(a). In equation (50), as has already been 
remarked, the phase of the exponential is just ; J L(q,q)dt written 
as a Riemann sum due to our subdivision of the time into finite, 
but small, intervals. The obvious suggestion is, then, to replace this 
exponent by 7 times the more general action. The action must of 
course first be expressed in an approximate way in terms of q;,t;, in 
such a way that as the subdivision becomes finer and finer it more 
nearly approaches the action expressed as a functional of q(t). 

In order to get a clearer idea of what this will lead to, let us choose 
a simple action function to keep in mind, for which no Hamiltonian 
exists. We may take, 


of = / * io ~ V(x(t)) + k2&(t)a(t + a} dt (67) 
—Co 

which is an approximate action function for a particle in a potential 

V(a) and which also interacts with itself in a mirror, by means of 

half advanced and half retarded waves, exactly as in (8). 

In the expression of equation (50) the integral of Z extends only 
over a finite time range T; to T2. Our action (67) is meaningless for 
a finite time range. In fact if we were to integrate over the range 
from T, to T2 the action might still depend on values of x(t) outside 
this range. 

This difficulty may be circumvented by altering our mechanical 
problem. We may assume that at a certain very large positive time 
T2, and at a large negative time Tj, all of the interactions (e.g., the 
charges) have gone to zero and the particles are just a set of free 
particles (or at least their motion is describable by a Lagrangian). 
We may then put wave functions, y and w, for these times, when 
the particles are free, into (50). (We might then suppose that the 
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motion in the actual problem may be a limit of the motion as these 
times T; and T> move out to infinity). We therefore compute, by 
analogy to (50) the quantity 


: i 
(xlF'b) = fx (dp, ) exp {polar 841 40, 4-15++-4y,)} 


F( V9 dqp, - - S914, 68 
“Fs 8 G15'G0-205) Wr AG) A ) 
In the limit as h — 0 this will lead to the classical action principle, 
d6@ = 0, in the way outlined by Dirac (see page 27) since nothing is 
altered to invalidate that argument. 
We may obtain the fundamental relation of our quantum mechan- 
ics, analogous to equation (52), by integrating the formula for the 


average of a by parts, to obtain, 
1 AV 9(4) F) i faked 
= —-(y|F - —|v). 69 
Ol Ferme WW = Rl BW) (68) 


As we have remarked above, this contains the analogue of the equa- 


tions of motion as well as the quantum commutation rules. 


6. Conservation of Energy. Constants of the Motion 


Because of the importance in ordinary quantum mechanics of op- 
erators which correspond to classical constants of motion, we shall 
mention briefly the analogue of these operators in our generalized 
formulation. Since these are not needed for the remainder of the 
paper, they have not been studied in detail. 

The notation will be as in the classical case described in section 
3, of the first part of the paper. The general discussion given there 
applies equally well in this case, so that we shall not repeat it. We 
will suppose, for simplicity that there is only one coordinate, q(c), 
instead of the N coordinates qn(o). 
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From the equations of motion (69) we can verify directly that, 


tj=te 
OD Z 1 ve, ree 


t;=te 


Puy p> woe (70) 


tj=t1 


If t; and ty are far apart, we can suppose that, as we have proved in 
the classical case, if the action is invariant with respect to the trans- 
formation gq — q+ ay, the expression on the left of (70) multiplying 
#F, can be expressed as a difference J;, — J7,, where the functional J;, 
involves the coordinates in the neighbourhood of tz and J;, involves, 
in the same way, coordinates for times in the neighbourhood of ¢}. 

If the expression, Y, involves coordinates for only a finite range 
around some time, to, and ¢; and ft, are outside this range (tz > 
to > t,) then the right side is independent of ¢; and tg. This is the 
analogue of the fact that I is a constant of the motion. In this case 
also, the left side becomes analogous to the quantity JF — FI, and 
the right side is just a differential operation performed on F’. The 
differential operation is characteristic of the group of transformations 
from which I is derived. 

Thus, for a displacement in the x direction, the differential oper- 
ator is #. the corresponding constant of the motion is momentum 
in the x direction, as we have the analogue of the operator equation 
Dyk — Fp, = hor More accurately, we have the analogue of the 
equation, ek A (t2- to), e7 FA (f2—to) po Fer ot) ye 7A toh) 
hor for all a tg. For the time-displacement the differential 
operator is a _ g and the constant is the negative of the energy, so 
that we have the analogue of HF’ — FH = oun — hor If F does 
not depend explicitly on the time, we can write the right side in the 
usual way, as Ap : 

For the energy expression, classically y(o) = g(a). The formula 
(60) can be deduced from (70) most simply by writing for y; the 
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form, y; = 5(e+ 44 = —=_. oh in the case that the coordinates are 


ti4 
rectangular. The method Ebel in connection with (60) (page 


39) of augmenting all the times between ¢; and fz by a fixed amount 
6 may be applied here also to obtain an alternative expression for 
the energy. 


7. The Role of the Wave Function 


The problem discussed in this section is that of the existence of a 
wave function for times between JT) and 7). 

It is to be noted, that in view of the relation (68) it is no longer 
possible to express the formula for the averages in some such simple 
form as { ¢3F¢d Vol. Suppose that F is especially simple so that 
it contains only the coordinate qo itself. (For example, perhaps we 
want to compute the average value of qo.) According to Eq. (68), it 
may be expressed in the form, 


: ROMaCn Meter (71) 


where p(qo) is the result of integrating the integrand of (68) with 
respect to every variable q; except go. This is to be compared to the 
usual expression when a Hamiltonian exists, 


/ #8(q0)F(40)61(40)/Go dao (72) 


They would be equivalent if (qo) could be expressed as the product 
of two functions ¢3(qo) and $1(go) in a natural and useful way. In 
general, however the integral of exp tA over all the variables except 
go cannot be so expressed. For the particular case that & is the 
integral of an ordinary Lagrangian function of velocity and position, 
the exponential can be broken up into two factors; 


do — q- — q-2 
L| ——— —t_ L| ———,, q_ 
expe ee ‘.): (to 1) + (2 ae aa :) 


(ta-ta)+.. 
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and, (73) 


an) +... 


These factors contain only the variable gg in common, so that when 
the integrations on the other variables are performed in expression 
(68), the result remains factorable. The quantity ¢1(qo) is the result 
of integrating the first factor, and is expressed in a form exactly 
similar to (47). The quantity $3(qo) is expressed in terms of y* 
by an equation like one obtains from (47) by taking the complex 
conjugate (i.e., (48)). 

We can take the viewpoint, then, that the wave function is just 
a mathematical construction, useful under certain particular con- 
ditions to analyze the problem presented by the more generalized 
quantum mechanical equations (68) and (69), but not generally ap- 
plicable. It is not unreasonable that it should be impossible to find 
a quantity like a wave function, which has the property of describing 
the state of the system at one moment, and from which the state at 
other moments may be derived. In the more complicated mechani- 
cal systems (e.g., the example, (67)) the state of motion of a system 
at a particular time is not enough to determine in a simple manner 
the way that the system will change in time. It is also necessary to 
know the behaviour of the system at other times; information which 
a wave function is not designed to furnish. An interesting, and at 
present unsolved, question is whether there exists a quantity analo- 
gous to a wave function for these more general systems, and which 
reduces to the ordinary wave function in the case that the action is 
the integral of a Lagrangian. That such exist is, of course, not at 
all necessary. Quantum mechanics can be worked entirely without a 
wave function, by speaking of matrices and expectation values only. 
In practice, however, the wave function is a great convenience, and 
dominates most of our thought in quantum mechanics. For this rea- 
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son we shall find it especially convenient, in interpreting the physical 
meaning of the theory, to assume our mechanical systems is such 
that, no matter how complex between the time 7) and 7», outside 
of this range the action is the integral of a Lagrangian. In this way 
we may speak of the state of the system at times 7, and 7%, at least, 
and represent it by a wave function. This will enable us to describe 
the meaning of the new generalization in terms with which we are 
already familiar. This we do in the next section. 


8. Transition Probabilities 


We shall suppose, as suggested above, that our action has the form 
of the integral of a Lagrangian for times 7) or later, and for times 
T, or earlier, but that it is arbitrary in between. In this way, we 
may speak of the state of the system at time 7) as being given by 
a wave function w, and of the state of the system at time 72, by a 
wave function y. We can then make the physical assumption that 
the probability that the system in state w at time T, will be found, 
at the time T», in the state y is the square of the absolute value of 
the quantity (x|1|w). The quantity may be defined by the expression 
(68) with F' replaced simply by unity. 

We can define other physical quantities in terms of this, by de- 
termining the changes in this probability, or rather in the quantity 
(x|1]w), produced by perturbations of the motion. 

We shall indicate by a subscipt the action for which the quantity 
(x|1]) is calculated, by writing (y|1|v) ~ if the action is <. Suppose 
the action is slightly altered (in the interval T, to T2) to become 
A+eFf where € isa very small parameter. From the form of equation 
(68) we would have, 


(x|1]h) orrex = (xle™? |W) (74) 


so that if € is small enough to insure convergence, we may write, 


2 


a aye XIF Wr =e 


(75) 


OID) ober = Old) + SOF IV) 
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We can therefore interprete (y|¥|w),.y by saying it is 
24 ((x|L |b) +eF) at € = 0. It should be emphasized here that 
(x|FGY|v)~ where F and Y are any two functionals cannot in 
general be written in a way analogous to a matrix product (e.g., 
om (X1F |lom) (Om|Y|v)) as can be done in the usual mechanics. 
(This is because ¥ and Y may overlap in time, and so neither be 
before the other.) The term (x|.FY|wv) .~ may either be interpreted 
as (x|-F |W) ~ with ¥ replaced by ¥Y, or alternatively, as the first 
order change in (x|.F|wW).v on changing & to A +eF (see equation 
(76)). 

We have, incidently, derived a perturbation equation (75), which 
may be easily generalized (change & to # +7 in (74), differentiate 
both sides with respect to y, and set y = 0) to read, 


e 

(XIFb) tey = (XIF|Y) raed = (xI Fb) at — 55 sa (XI FPG) + 

(76) 
This permits us to express the average of a functional for one ac- 
tion function in terms of averages of other functionals for a slightly 
different action. For some particular problems, such as, for exam- 
ple, the electrodynamic one, it may turn out that the action may be 
considered as the sum of two terms, —% + &, the first expressible 
as the integral of a Lagrangian, while the second, not so express- 
ible, may be considered as a small perturbation. Equation (76) then 
permits the actual matrix elements to be expressed in terms of the 
matrix elements with the Lagrangian action, ~, alone. Since for 
this action, .%, the problem is comparatively simple because wave 
functions can be defined, the relation (76) will serve as a practical 
method for solving problems in these cases. 

Perturbations may also be considered as producing transitions. 
Suppose the state of the system at the early time 7) was w. Let 
us choose some state vy, at the time 75, and ask for the probability, 
with the perturbed action # + ¢F¥, that the system will be found 
in this state at this time. It is just |(x|) ~+eg|°. We shall further 
suppose that x is so chosen that were it not for the perturbation, 
e¥, the system would have no chance of being found in the state y; 
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that is, (x|1|),y = 0. Therefore from equation (75), to the order €? 
the probability that the system originally in state w, is found in the 
state xv at that time T2, due to the perturbation — i.e., the transition 
probability is just, 


arlxleFW a. (77) 


For the special case of a simple perturbing potential acting for a 
time 0 to T, we have ¢¥ = — i Vadt, so that our transition prob- 
ability becomes the more usual expression (Compare, Dirac, “The 
Principles of Quantum Mechanics”, p. 177, (Eq. (20)) 


2 


T 
bd i Vat |) of 


1 
Fi 

It is of interest to notice that if we are given that the wave function 
at time 7) is wy,, although we cannot trace the behaviour of the 
function through the interval T; to Ty) we can, nevertheless, answer 
the question, “What will the wave function be at the time 7, and 
later times?” (Of course, if we know the wave function at the time T) 
we can find it later, since it satisfies a Shrodinger equation from that 
time on.) If we call the wave function at time T», ~p,, and expand it 
in terms of a complete set of orthonormal wave functions x, at that 
time, say pr, = >> AnXn, the coefficients a, will be just (xn|1|W7,) ~- 
Therefore, we have, 


br, = >) Xn(Xnlllbr a - 


On account of the form (68) for calculating (yn|1\w7,).<7 we find that 
this may also be expressed as, 


V7, (Q) = [ex {51 Qdmar-- >90,9-1;--- an) 


V94Gm-1 +++ G44 p, 


V7, (97,)° =a (78) 


in the limit when the subdivisions in time become infinitely fine, 
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where &(Q---qr,) is that part of the action functional applicable 
for times from T, to T2 (compare (47)). 


9. Expectation Values for Observables 


The physical interpretation which is given in the above section, al- 
though the only consistent one available, is rather unsatisfactory. 
This is because the interpretation requires the concept of states rep- 
resentable by a wave function, while we have pointed out that such 
a representation is in general impossible. We are therefore forced to 
alter our mechanical problem so that the action has a simple form at 
large future and past times, so that we may speak of a wave function 
at these times, at least. This difficulty also finds itself reflected in the 
mathematical formulation of the operations performed in calculating 
average values from the equation (68). We have not defined precisely 
what is to be done when the action does not become simple at times 
far from the present. 

One possibility that suggests itself is to devise some sort of lim- 
iting process so that the interpretation of the last section could be 
used, and the limit taken as T; — —oo and Th — +oo. The au- 
thor has made several attempts in this direction but they all appear 
artificial, having mathematical, rather than physical, content. 

An alternative possibility is to avoid the mention of wave func- 
tions altogether, and use, as the fundamental physical concept, the 
expectation value of a quantity, rather than a transition probabil- 
ity. The work done in this connection, which is presented in this 
section, is admittedly very incomplete and the results tentative. It 
is included because many of the formulas derived would seem to be 
of value, and the author believes that the solution to the problem of 
physical interpretation will lie somewhere in this direction. 

In ordinary quantum mechanics the matrix element of an operator 


A, between two states w, and wy, is given by, 


y ems : ax Adin Vol . 
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The expected value for the quantity represented by the operator 
A, for the state represented by the wave function wy, is, Ann = 
J ~* Aw, d Vol. In an exactly similar way, we have used our definition 
(68) for a matrix element of a functional F’ between the state whose 
wave function at the time 75 is y, and state whose wave function at 
the time 7; is ~. To compute the expected value of the functional 
whose wave function at the time T is wr,, we calculate (w7,|F|v7,), 
where wr, is given in terms of wr, by the equation (78). 

Another important quantity in quantum mechanics is the trace of 
a matrix,/* Tr[A] = >> 
ized) average expected value when a priori each state w, is consid- 


alln Ann- It measures the relative (unnormal- 
ered as equally likely. We shall speak of it simply is the expectation 
of A. 

Let us suppose A is an operator which has only certain particular 
eigenvalues a, so that Ayn = GnXn, for some set of function y,. Let 
us also suppose Fy, (a) is a function of x which is zero unless x = an, 
and Fa, (an) = 1. Then let us find the trace, Tr[BFy,,(A)]. (Fa, (A) 
is a projection operator). The matrix B- F,,,(A) has for its k, | 
element, in a representation with the functions xn, [BFu, (A)|m = 
Vein Brm|[Fa,(A)]mi- But, since A is diagonal in this representation, 
[Fan (A)|mi = 0 if m € 1, and equals Fo, (am) otherwise. Therefore 
[B- Fu, (Ala = BriFa, (a1). Now, Fa,, (a) = 0, unless 1 = m so that 
we find, Tr[BF,,,(A)] = Bmm. That is to say, the trace of BFy,, (A) 
is the expected value of B for the state for which the quantity A has 
the value am. 

In a like manner it is not hard to show that, Tr[F%,, (B) - Fa,, (A)] 
is equal to the probability that the quantity B will be found to have 
the value b, in a state where it is known that A has the value a,, 
(neglecting degeneracies). 

These examples are given to remind the reader of the fact that by 
means of the concept of trace all of the important physical concepts 


14 See J. von Neumann, “Mathematische Grundlage der Quantenmechanik” (1932), 
p. 93, ff. 
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can be derived. What corresponds to taking a trace in our form of 
quantum mechanics? 

As we can see from equations (68) and (78) the expression for 
(Urs|Flwr,) can be written in the form, f p(q.q/)Wn (q)wrr, (q))dada! 
where p(q, q’) is given by a complicated expression obtained by substi- 
tuting (78) into (68). That is to say, what corresponds to the diagonal 
element Ay, may be written in our case as f p(q,q/)¥%(q)Wn(q')dadd’. 
The sum of the diagonal elements, and therefore the trace, corre- 
sponds to 


YS [ola weavn( al). 


As is well known, however, the sum over all n of w*(q)Wn(q’) is just 
6(q — q’), so that what corresponds to the trace of F is f p(q,q)dq. 
We are therefore led to consider the quantity, 


a 
Tr(F) = fe {-F Mtryooas-- Mb mreio tr Ip 


a 
x exp {Fp sdns ton! sisde,]} 


x F (22. Q13G0025).4 \/ Ir, dp, ‘: VJ/Ir, dd, 


VG Mm VIET ans, VGA "+ VEEI—-m'+1 (7g) 
A*.-- A* A-A 
The passage to the limit of infinitely fine subdivisions is implied, as 
usual. Since now there are no wave functions, and therefore nothing 
special about the times T) and 75, we can consider the true trace to be 
the limit of the Tr(.F), defined in (79), for a sequence of mechanical 
systems each of which has an action identical to the true one for 
successively longer time intervals. (The problem of convergence is 
ever present.) 
The trace defined in (79) is identical to the usual trace of quantum 
mechanics if ¥F is a function of one coordinate (e.g., go) only, and 
the action is the time integral of a Lagrangian. It lacks, however, in 
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the general case, one important property, and it is this that makes 
the results of this section so uncertain. The trace of an arbitrary 
functional is not always a real number! 

We lack some condition on the functionals which we are to place 
into (79), in order to obtain a real value, so that we can say that the 
functional represents some real observable quantity, the expectation 
of which is the trace. That is to say, we lack the condition on a 
functional that it represent an observable, analogous to the condition 
in ordinary quantum mechanics that an operator, to represent an 
observable, must be Hermitian. The correct criterion is not known 
to the author. The most obvious suggestions revolve around the 
generalization to (79) which is obtained by letting ¥ be a function 
of the q’ variables, as well as of g. A real trace is obtained if F 
is any function symmetrical with respect to interchange of each q 
with the corresponding q’ (ie., if F(...q1,40,---3---4590)---) = 
F(...G4,9)-+-3-+- 4,90). For example ¥ might be a function of 
3 (4; +4) only. This symmetry condition may be all that is necessary 
to insure that the functional correspond to a real observable. The 
product, and the sum of two such symmetrical functionals is again 
symmetrical. 

We pass from the general problem of a criterion for any functional 
to that of trying to determine the form of certain special functionals 
which we should like to identify with special observables (in partic- 
ular, projection operators). Let us first try to find the F that we 
are to place in (79) so that the resulting trace is the probability that 
q at the time fg (i.e., g(t2)) has the value 6 if it is known that q at 
the time ft; has the value a. If is the integral of a Lagrangian the 
answer is simply ¥ = 6(qz, — b)d(qz, — a), as can be immediately 
verified. On the other hand, I have not succeeded in prove that the 
trace of this quantity is in general real. The trace of, 


Gig + Gy Ga tae: 
(at b]-6 Trager 
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is, however, real and gives the same value to the desired probability 
if the action is an integral of a Lagrangian. 
We may therefore tentatively assume that, 


7 + gt 7 +1 


gives the relative probability that, if qg has the value a at the 
time ¢,, a measurement of g at the time fg will lead to b, in the 
range db. (The absolute probability may be gotten by dividing by 


Tr (6(e% — 4). 
-») 3 (BAe) a 


We shall likewise assume that 
E 2 2 
gives the relative probability that, if g has the value a at the time ¢1, 


Tr (° (3 [ass Bs decks = vip + Ge 


a measurement of velocity at the time tz will lead to v in the range 
dv. That this gives the correct answer, in the case of a Lagrangian 
action which involves the coordinate q in the kinetic energy term as 
amq’, is shown below. 

In a similar way we can define the probabilities for any quantities 
involving linear combinations of the coordinates (and that includes 
velocities, accelerations, etc.). For example, the probability that the 
difference in the coordinate qg at the time ¢3 and its value at the time 
tz is between b and b+ db, when it is known that at the time ¢; the 
velocity plus c times the position is a, is given by the trace of, 


G+, Hs +4, 
(pe 
2 2 


eG Gite + Gre Gt, 
2€ Qe 


+ c(qz, + G&,) — : db. 


(This has been checked for a free harmonic oscillator). It is possible 
that this is even true if f3 is before t,, and ft after t1. 
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where Y involves times earlier than tz does agree with the usual form 


We shall now show that 


mfg {2 | Setet Vinte Vin + Gy 
E 2 2 


for finding the probability of a given momentum, mv, if the action 
is the integral of a Lagrangian, say sm? — V(q). In the expression 
(79) the integral on gp, can be immediately performed, inasmuch as, 
(see (45)) 


ee le) Ve), -# le (EE) ve,)] 


Thus the integral on g/, means merely replacing q/,, by dm. We have 
thus again the same expression as (79) with one term integrated 
from the end. We can repeat this process many times, telescoping 
the Lagrangians, until we come to the term q, +¢. (That is to say, 
we could have taken T2 equal to tg + € without loss of generality). 
Let us suppose also that we have integrated all variables following 
tz and qj, and the net result is p(qi.,q,). (It can be expressed in 
this form for any Y satisfying our condition in virtue of the form of 
the action.) That is to say, we must calculate, 
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The phases of the exponentials, when combined, reduce simply to 


; 2 ! 2 
ve ; iy Qtote — Ute _ tote — Fto 
— E E ; 


The integral on the 6 function, over q;.+<, requires that one substitute 

é . +41. : . 
for this quantity q+. = aye + ve, and multiply by «, to obtain, 
in the final result, 


are 4 EY gf 
|e e hat een 42 o( Gta: dy) 1Gty dq}, z 


This agrees with the usual expression for the probability of a given 
momentum p, if p= mv. (The extra factor of normalization, comes 
from the fact that dp = mdv.) 


10. Application to the Forced Harmonic Oscillator 


As a special problem, because we shall need the results in the next 
section, we consider, from the point of view of the modified quantum 
mechanics, the problem of the forced harmonic oscillator; that is, an 
oscillator interacting with another system. This problem, when the 
oscillator is interacting with a Lagrangian system, can of course be 
handled by the usual methods of quantum mechanics. We shall see, 
however, that the added power of looking at all the times at once, so 
to speak, which arises in such equations as (68), has some advantages. 
With a wave function, the oscillator and the interacting system are 
so firmly interlocked, mathematically, that it is hard to study the 
properties of the oscillator without, at the same time, solving for the 
motion of the interacting system. We shall be able here, however, to 
solve that half of the problem which involves the oscillator, without 
solving the entire problem. 
If x is the coordinate of our oscillator, the action is of the form, 


2 


ma mwa 
2 2 


+ eh : (80) 
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where ™ is the action of the other particles of the system of which 
the oscillator is a part, and 7(t)- a is the interaction of the oscillator 
with the rest of the system. Thus, if we symbolize the coordinates of 
the rest of the system by Q, y(t) is some functional of Q. (We might 
also contemplate that the oscillator is not interacting with any other 
quantum mechanical system, but is simply acted on by a forcing 
potential. In this case, y(t) is a simple function of t, and represents 
the force acting on the oscillator at time t.) We shall suppose that 
y(t) is zero for times outside the range 0 to T, and shall compute 
matrix elements of the form (x7|1|Wo).v where wo is a wave function 
at time t = 0, (it involves Q as well as x), and yr is a wave function 
at time t = T. Writing this in more detail, by equation (68) it is, 


m—-1 


a 


(xrltlv)r =f xr(Qn.ctm) exp Fy whl Qj] + 


i=0 

ye ees ee ala at 

nears 5) Vivi i+1 i 
JIdQm ++ /GIQo 
x 1 
¥0(Qo, Xo) Aad, 

a ce i ed (81) 

BEEP (ti, — trn—1) +++ 4/ 2 (t — to) 


where the A,, are the normalizing constants appropriate to the action 
&\Q;|, and Q; is the variable Q at time ¢;, x; the variable x at the 
time t;, and 7; = y(t;), a function, perhaps, of the Q;. We are to set 
tm = T and to = 0. 

Inasmuch as the integrand is a quadratic function of the x; (for 
i #0 or m) we may actually perform the integration over these 2;, 
and leave the integrations on Q,; to be performed later. We will 
simplify the work by taking all the intervals, t;4, — t;, equal, and 
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equal to «. We are therefore led to consider the quantity, 


2 
xy 
ctemouts= ow [f-~ fof che 


me>T 


2,2 
_ mux iad 1] }—ne d&@m—1 ++: dry (82) 


2 / 2rehi / 2nshi 


We shall perform the integrations one after the other, starting 
with x1, then x2, etc. We shall determine the result by a recursive 
method. We can guess, that after the integrations from x, to x; —1 
have been performed, the integrand will depend, except for a factor 
involving other x’s, quadratically on the variable x;, according to the 
following form; 


dx; 
Qnhei 
m 


where A,, 6;, 7; are constants, independent of x;, 7441, etc., which are 


Azer (iti tBivivit1 + divi tm) é (83) 


to be found. The integral on x; may now be performed, by writing 
the exponent as, 


rer Biwigr +6]? i (Gitin1 +6)? 4 
m 2a; h 4a; h 


changing the variable x; to ap ee and using [ ery dy = Vini. 


We obtain —4+— exp + (ni — Bias) Multiplying this by the 
vf 2ea;/m h 4a; 
term 
i (2mx? M@a1Lieg  mwe 


giving the part of the exponential in (82) depending on x;+1, we find 
that after the integration on x; is performed the dependence of the 
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integral on x;41 is, 


A; tf 2Mei., Mayivigg mMwEe 9 
\/2ea,;/m € E 
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This is again of the form (83), so that our guess is self-consistent if 


m 
He he, 
~ 2EQ; (84) 


we set, 


2 2 
mB mute 
+1 = Z Hee: 2? (85) 
m 
fu =-™, (86) 
26; 3; 
dina = EV%i41 — ae (87) 
V 
F 
M+1 = 1h — Hee (88) 


We note, therefore, 3; is a constant —. We shall solve the other 


equations in the limit « — 0 under the assumption (which will be 


self-consistent, and is therefore correct) that a; — 32, 6;, ni, Ai are 


all finite. 
Replacing (86) in (85) and setting 


a= —+A, (89) 
find, 


Qe 4e2x(B+r) 2 


Expanding the fraction TEx by the series 1 — 2€ yj — Ne and 


keeping no further terms, find, A;41 — A; = 2e \2 re? Ase — 0, 
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then, A; may be considered as a function of t;, so that dividing both 
sides by ¢, in the limit we may write, 


dx Dei ani 
This has the solution 
i = cot w(t + const.) . (91) 


Since, for small ¢, (e.g., t =¢), ais 4, A must approach #. This it 
does if the const. is zero in (91). Therefore, 


m m mw 
pilserey yore ke eae a 92 
OA + De + 5 cot (92) 
Place this in (84), yielding, Aji1 = area = A; (1 — ¥ cot wt) 
for small ¢. In the limit then, aA = —A-$cotwt. This implies 


— _const. : P i 
A= Jamar” Since for t of the order ¢, A is Jia the constant 


must be \/54% so that we find, 


/ mu 
A=,/—————_.. 
2rihsin wt (28) 


Putting our values of a and (3 into (87), find, 


mo; 


io = ee 
we wee 2e(# + %& cot wt) 


which leads to the differential equation a =7y-06-wecotwt. This 
equation has the general solution 
1 t const. 
é6=- [ ¥(s) sinwsds + — : 
sinwt Jo sin wt 


Since, for small values of t, of order ¢, 6 approaches — x9, constant 
here is —mwag. Therefore, 


MWXO 1 


t 
6=— + i 7(s) sinwsds. (94) 
0 


sin wt sin wt 
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Replacing a by its leading term, 3+, in Eq. (89) we obtain, in the 
limit, the equation, a = ae Thus, we find, 
* [@/?? 
=-— | —d 95 
n= f (95) 


and is subject to the condition that as t — ¢, 7 —- eae. We may 
now utilize these results to compute Gy(2%m, 20; T). 
Since it is clear from its mode of formation that, 


ek (etm —m— Grae to L) 
= A ek Om ®in+8m&m%m+1+5mzm+Nm) 
we may calculate G in the limit as « — 0, me — T, from our ex- 
pressions (92), (86), (94), (95) and (93). After a little algebraical 


rearrangement, it may be written in the rather convenient form (we 
have replaced x», by x, and x by 2’), 


Dp = = bee uf ; t 
Gy (ax, 2"; ) Go(x a,x b; ) OD Sok 
T et 
“ | | v(s)n(t)sinwo(t~ s)dsde + m2usinwT ab}, 
0 JO 
(96) 
where 
1 T 
a= aor | ¥(t) cos wtdt (97) 
and 
1 T 
(as ——_ | y(t) cos w(T — t)dt (98) 
mu sinwT Jo 


and Go(y, y’;T) is the value to which G,(y, y’;T) reduces when y = 
0, and is the well known generating function for the unperturbed 
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harmonic oscillator; 


CG an / mw mw 
oy 957) 2rihsin wt ae 2hsinwT 
x {(y* + (y')?) cos wT — 2yy"}. (99) 


Going back to equation (81), we may now express the average in 
question by the simplified expression, 


(xr|1|vo)aw = fr Qn,apetel-Oe 


Eis Teej ane ae ee 
Am**: Aj 
(100) 


We shall need this formula in the next section. It is the analogue, in 
a sense, of the solution (26), (27), (28) of the equations of motion for 
the oscillator, in the classical case. Here, as there, the solution for 
the motion of the oscillator is expressed in terms of the interacting 
system, without it actually being necessary to solve for the motion 
of this interacting system. 


11. Particles Interacting through an Intermediate 
Oscillator 


The problem which we discuss in this section is the quantum ana- 
logue of the problem discussed in Section 4 of the first part of the 
paper. Given two atoms A and B, each of which interacts with an 
oscillator O, to which extent can the motion of the oscillator be dis- 
regarded and the atoms be considered as interacting directly? This 
problem has been solved in a special case by Fermi! who has shown 
that the oscillators of the electromagnetic field which represent lon- 
gitudinal waves could be eliminated from the Hamiltonian, provided 
an additional term be added representing instantaneous Coulomb 


15 E. Fermi, Rev. Mod. Phys. 4, (1932), p. 131. 
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interactions between the particles. Our problem is analogous to his 
except that in the general case, as we can see from the classical ana- 
logue, we shall expect that the interactions will not be instantaneous, 
and hence not expressible in Hamiltonian form. 

Drawing on the classical analogue we shall expect that the sys- 
tem with the oscillator is not equivalent to the system without the 
oscillator for all possible motions of the oscillator, but only for those 
for which some property (e.g., the initial and final position) of the 
oscillator is fixed. These properties, in the cases discussed, are not 
properties of the system at just one time, so we will not expect to 
find the equivalence simply by specifying the state of the oscillator 
at a certain time, by means of a particular wave function. It is for 
this reason that the ordinary methods of quantum mechanics do not 
suffice to solve this problem. 

The natural question to ask, working from analogy with the clas- 
sical system, is: “What is the expected value of .¥, a functional in- 
volving the particles only, if it is known that at time 0, the position of 
the oscillator was 7(0) = a, and that at the time T it was x(T) = 3?” 
If we can show that the answer to this question is, with a given fixed 
a and 3, the same as the expectation of ¥ calculated by a formula of 
exactly the form (79) for an action principle involving the particles 
alone, then we shall have found the conditions under which a direct 
interaction can be represented as acting through an intermediate 
oscillator. 

That is, we should like to satisfy, (see p. 53ff), 


/ / 
mr (#-5(=2 52 - 5) 5(2EAt — a) ) 
2 2 A,B,&O 


= const. - Tr(F) 4B, (101) 


where the trace on the left is computed for the action of the particles 
and the oscillator, and that on the right only involves the particles. 
(The constant appears because we are interested only in relative 
expectations, and can normalize the trace later.) 
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We shall simplify matters by supposing the particles are repre- 
sented by a set of coordinates, which we symbolize by Q, that the 
action for the particles is @, that the oscillator has the coordinate 
x(t), with the Lagrangian tg? - Buen, and that the Lagrangian of 
the interaction is y(t)x(t) where y(t) is some functional of Q(t). Let 
the action for the particles with the oscillator eliminated be 2%+-%, 
where .¥%, a functional of Q only, the action of interaction, is to be 
found to satisfy (101). 

If (101) is to be satisfied for all arbitrary functionals, F, we must 
have, on account of (79), 


a ss Tm 
[ex = |e + i: y(t)a(t)dt + a —(¢? — wet)a 
a - Tm 
x exp —F on + | 9 (t)a'(t)dt + i ae _ a 
0 0 


/ / 
5 (2 a 6)o(2 zs a) dxpdxy 
2 2 
dx---dx dz'-+-dz' 


ae eae ie RCPS: 


=const. f exp tlh + ]-exp—T lo + ¥1 


gp + xp ro + dx-+-dxdz'--+ da! 
«6( AAP — a) 6( A ol ii aaa Oar aaa 
(102) 


where %, 7, and -¥' are the same functionals but of a different 
variable (namely, Q’) than </, 7, and .%, and the integrations over 


the x’s are to be performed as prescribed in detail in (79).1° We 
A—Fs) 


can divide out the ex from each side, of course. Our main 
problem is to show that the left side, when integrated over all the 
coordinates of the oscillator, can be expressed as a product of an 


exponential function of Q’ only with an exponential of minus the 


16 Editor’s note: The right-hand side of (102) is missing in the original typescript of the 
thesis. Its form has been inferred by the editor from the surrounding text. 
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same function of Q only. This will be by no means generally true, 
and is a consequence of our special choice of 6 functions on the left 
of (102). 

Since as far as the variables x are concerned, the action is La- 
grangian, the integration over all x and 2’ from x7, to xp can be 
immediately performed in the way indicated at the end of Section 9 
(see page 55). In a like manner the integrations from x7, to x9 may 
be performed. The integrations on the a-primes from x, to ap_. 
can next be performed according to the method of the last section, 
and they yield G*,(xr,x9;T). Those on the corresponding interme- 
diate x’s result in Gy(x7,29;T). Thus we must prove that, 


/ d(xr — B)d(xo — a)G>, (rr, %0;T)Gy(xr, xo; T)dxrdxo 


ig 


= const.e7t” et . (103) 


The left side is simply Gy({,a;T)Gy(8,a;T), and by substitut- 
ing G, a in the formulas (96)—(99) we can find that this left side is of 


the form of the right side, if we take const. = ;7*—7 and % equal 
to 
g = MECN (G— 0)? + (a a)?] - (9 naa) 
2 sinwT 


+ | | ¥(t)y(s) sin w(t — s)dtds + m® sinwT + ab 
2mw 0 0 2 


with a and 6 as in (97) and (98). -¥’ is then the corresponding thing 
with 7’ replacing y everywhere. After a little rearranging, we get 


we ie a sin w( SEES Eee ya 


sinwT 


= a i sin w(T — t) sinwsy(s)y(t)dsdt 
mw sinwT 0 


mw cot wT (2 +02] — 


This is identical to the expression (35) obtained for the action in the 
corresponding classical problem in which the initial value of x is held 


Ba. (104) 


sinwT 
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at x(0) = a, and the final value of «(T) = @. The added constant 
term has, of course, no significance. (It will cancel a corresponding 
term in -¥’.) 

Thus, the particles with the action of interaction .4%7, may be re- 
placed by a system with an intermediate oscillator, provided that, 
in calculating the expectation of any functional of the particles, it is 
calculated under the conditions that the oscillator’s initial position 
is known to be a and its final position is known to be @. It is to 
be noted that we have not proved that, in general, the system with 
the oscillator is equivalent to one without, for that is not true. The 
equivalence only holds if the oscillator is known to satisfy certain 
condition. 

For the other example, (28), of conditions leading to an action 
principle, which we have considered in the classical case, we are led, 
here, to ask the question: “What is the expected value of ¥ if it is 
known that for the oscillator, 


sinwT 


Z 0) + 2(T)coswT — «(T) = Ro 


2 
and that, 5[a(T) + 2(0) coswT + (0) S23] = Rr?” 
To answer this question we must try to satisfy an equation anal- 
ogous to (102) but with §(*2 )) . 5 (eto - a) replaced by, 


1 
5(5{ + 2 + (27 + 2p) cos wT 


x + a —(¢, +2 
_ @rte + rye) — (er + 27) sinuit} Ro) 
EW 


1 
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In this case the integrations on x and 2’ can only be performed 
from x7, to x7+._, leaving an extra Lagrangian factor, and those from 
x7, can proceed only to x_;-. The integrations between xo and xp 
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can be performed as before, giving G functions. The result is that 
we must find an -¥ to satisfy: 


: phy 2 2 
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Since the integration of this complicated expression is perfectly 
straightforward, we shall not include it here. (It is best to integrate 
with respect to dxr,- and dx_, first. The term e! 3 = (@0)” ef + x 
can be neglected because it is of no importance in the limit as ¢ — 0.) 
The result is that the left side may be made equal to the right side 


by choosing the constant as 2p hats and taking -% to be 


T : = ; 
P= ih Rosinw(T —t) + Rrsinwt -y(t)dt 
0 


sinwT 


T t 
tf [ simwte—syryn(sjatas, (107) 


with a similar expression for .%', obtained by replacing y by y’. This 
is again in agreement with the classical result. 

To answer the question: “What is the expectation of ¥ if it is 
known that initially the position of the oscillator is w, and its velocity 
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is v?”, we must try to satisfy an equation analogous to (102), but 
with 5 (Ar — B) 6 (=F — a) replaced by 


ro + 2 To+@ etal, 
Ff OEE 0 ap gp Or MO =e ee 
( 2 w) ( 2e Qe . 


If this is done, however, and the integrations are carried out, the left 
side of (102) becomes 


. F 
i / ; 
aR OP amc y. 2(7(t) — ¥ (t)) (mv sin wt + ww cos wt)dt 


T t 
ep i, i (o(8) = OVE) +44())sinwt — ath (108) 


It is seen that (102) cannot be satisfied by any choice of .%, and 
there now appear in the exponential (108) cross terms between 
and y’, such as y(t)7(s). This corresponds to the classical finding, 
that no action exists in case the initial position and velocity are held 
constant. 

These results serve as a confirmation of our formal generalization 
to systems without a Hamiltonian. They have obvious application 
to electrodynamics into which we will, however, not go here. 

We should like to make a remark, before closing this section, about 
equation (108). Even though it does not lead to a system which 
can be expressed by a quantum mechanical least action principle, 
it is nevertheless correct, of course, that to find the average of the 
functional * we must multiply ¥ by the expression (108) times 
exp ¢(% — %) and integrate over all the Q and Q’. That is to 
say, the expected value of ¥ for this system is obtained in a way 
analogous to (79), except that the phase of the exponential, which in 
(79) is of the form +{2[Q] — #[Q']} is now of the form + A(Q,Q’), 
where the quantity Z involves the expression in (108). 

What should be the behaviour of the system, described in this 
way, in the classical limit as h — 0? Dirac’s argument (see page 27) 
in this case leads to the conclusion that only those values of Q, and 
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of Q’, will be of importance which satisfy both, 


BQQ) 9 nq 22@2) _ 9 


6Q(t) 6Q'(t) 

Inasmuch as 4(Q,Q’) = —B(Q’,Q), the second of these equa- 
tions results from the first by interchange of the Q and Q’. There- 
fore one solution of these equations would have the property that 
Q(t) = Q'(t) where Q(t) satisfies, 


5B(Q,Q’) —(). (109) 


824) law=ow 
With & as given by (108), this leads immediately to the classical 
equation got by substituting for x(t) from (26) in (24), and replacing 
x(0) by w, and «(0) by v. This suggests a way of quantizing systems 
which classically do not satisfy a simple principle of least action, but 
we shall not investigate this here. 


12. Conclusion 


We have presented, in the foregoing pages, a generalization of quan- 
tum mechanics applicable to a system whose classical analogue is de- 
scribed by a principle of least action. It is important to emphasize, 
however, some of the difficulties and limitations of the description 
presented here. 

One of the most important limitations has already been discussed. 
The interpretation of the formulas from the physical point of view 
is rather unsatisfactory. The interpretation in terms of the concept 
of transition probability requires our altering the mechanical system, 
and our speaking of states of the system at times very far from the 
present. The interpretation in terms of expectations, which avoids 
this difficulty, is incomplete, since the criterion that a functional 
represent a real physical observable is lacking. It is possible that an 
analysis of the theory of measurements is required here. A concept 
such as the “reduction of the wave packet” is not directly applicable, 
for in the mathematics we must describe the system for all times, 
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and if a measurement is going to be made in the interval of interest, 
this fact must be put somehow into the equations from the start. 
Summarizing: a physical interpretation should be sought which does 
not refer to the behaviour of the system at times very far distant 
from a present time of interest. 

A point of vagueness is the normalization factor, A. No rule 
has been given to determine it for a given action expression. This 
question is related to the difficult mathematical question as to the 
conditions under which the limiting process of subdividing the time 
scale, required by equations such as (68), actually converges. 

The problem of the form that relativistic quantum mechanics, and 
the Dirac equation, take from this point of view remains unsolved. 
Attempts to substitute, for the action, the classical relativistic form 
(integral of proper time) have met with difficulties associated with 
the fact that the square root involved becomes imaginary for certain 
values of the coordinates over which the action is integrated. 

The final test of any physical theory lies, of course, in experiment. 
No comparison to experiment has been made in the paper. The 
author hopes to apply these methods to quantum electrodynamics. 
It is only out of some such direct application that an experimental 
comparison can be made. 

The author would like to express his gratitude to Professor John 
A. Wheeler for his continued advice and encouragement. 
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SPACE-TIME APPROACH TO 
NON-RELATIVISTIC QUANTUM MECHANICS* 


R. P. FEYNMAN 
Cornell University, Ithaca, New York 


Abstract 


Non-relativistic quantum mechanics is formulated here in a differ- 
ent way. It is, however, mathematically equivalent to the familiar 
formulation. In quantum mechanics the probability of an event 
which can happen in several different ways is the absolute square 
of a sum of complex contributions, one from each alternative way. 
The probability that a particle will be found to have a path x(t) 
lying somewhere within a region of space-time is the square of a 
sum of contributions, one from each path in the region. The contri- 
bution from a single path is postulated to be an exponential whose 
(imaginary) phase is the classical action (in units of f) for the path 
in question. The total contribution from all paths reaching z, t 
from the past is the wave function 7)(a, t). This is shown to sat- 
isfy Schroedinger’s equation. The relation to matrix and operator 
algebra is discussed. Applications are indicated, in particular to 
eliminate the coordinates of the field oscillators from the equations 
of quantum electrodynamics. 


1. Introduction 


It is a curious historical fact that modern quantum mechanics began with 
two quite different mathematical formulations: the differential equation of 
Schroedinger, and the matrix algebra of Heisenberg. The two, apparently 
dissimilar approaches, were proved to be mathematically equivalent. These 
two points of view were destined to complement one another and to be 
ultimately synthesized in Dirac’s transformation theory. 

This paper will describe what is essentially a third formulation of non- 
relativistic quantum theory. This formulation was suggested by some of 


* This article was first published in Rev. Mod. Phys. 20, 367-387 (1948). 
Copyright © 1948 by the American Physical Society. Reprinted with permission. 
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Dirac’s!? remarks concerning the relation of classical action? to quantum 
mechanics. A probability amplitude is associated with an entire motion of 
a particle as a function of time, rather than simply with a position of the 
particle at a particular time. 

The formulation is mathematically equivalent to the more usual formu- 
lations. There are, therefore, no fundamentally new results. However, there 
is a pleasure in recognizing old things from a new point of view. Also, there 
are problems for which the new point of view offers a distinct advantage. 
For example, if two systems A and B interact, the coordinates of one of 
the systems, say B, may be eliminated from the equations describing the 
motion of A. The interaction with B is represented by a change in the 
formula for the probability amplitude associated with a motion of A. It is 
analogous to the classical situation in which the effect of B can be repre- 
sented by a change in the equations of motion of A (by the introduction of 
terms representing forces acting on A). In this way the coordinates of the 
transverse, as well as of the longitudinal field oscillators, may be eliminated 
from the equations of quantum electrodynamics. 

In addition, there is always the hope that the new point of view will 
inspire an idea for the modification of present theories, a modification nec- 
essary to encompass present experiments. 

We first discuss the general concept of the superposition of probability 
amplitudes in quantum mechanics. We then show how this concept can 
be directly extended to define a probability amplitude for any motion or 
path (position vs. time) in space-time. The ordinary quantum mechanics 
is shown to result from the postulate that this probability amplitude has a 
phase proportional to the action, computed classically, for this path. This is 
true when the action is the time integral of a quadratic function of velocity. 
The relation to matrix and operator algebra is discussed in a way that stays 
as close to the language of the new formulation as possible. There is no 
practical advantage to this, but the formulae are very suggestive if a gen- 


1 Pp. A.M. Dirac, The Principles of Quantum Mechanics (The Clarendon Press, Oxford, 
1935), second edition, Sec. 33; also, Physik. Zeits. Sowjetunion 3, 64 (1933). 

2 Pp. A.M. Dirac, Rev. Mod. Phys. 17, 195 (1945). 

3 Throughout this paper the term “action” will be used for the time integral of the 
Lagrangian along a path. When this path is the one actually taken by a particle, moving 
classically, the integral should more properly be called Hamilton’s first principle function. 
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eralization to a wider class of action functionals is contemplated. Finally, 
we discuss applications of the formulation. As a particular illustration, we 
show how the coordinates of a harmonic oscillator may be eliminated from 
the equations of motion of a system with which it interacts. This can be 
extended directly for application to quantum electrodynamics. A formal 
extension which includes the effects of spin and relativity is described. 


2. The Superposition of Probability Amplitudes 


The formulation to be presented contains as its essential idea the con- 
cept of a probability amplitude associated with a completely specified 
motion as a function of time. It is, therefore, worthwhile to review in 
detail the quantum-mechanical concept of the superposition of probability 
amplitudes. We shall examine the essential changes in physical outlook 
required by the transition from classical to quantum physics. 

For this purpose, consider an imaginary experiment in which we can 
make three measurements successive in time: first of a quantity A, then 
of B, and then of C. There is really no need for these to be of different 
quantities, and it will do just as well if the example of three successive 
position measurements is kept in mind. Suppose that a is one of a number 
of possible results which could come from measurement A, 0 is a result that 
could arise from B, and c is a result possible from the third measurement 
C.4 We shall assume that the measurements A, B, and C are the type of 
measurements that completely specify a state in the quantum-mechanical 
case. That is, for example, the state for which B has the value 6 is not 
degenerate. 

It is well known that quantum mechanics deals with probabilities, but 
naturally this is not the whole picture. In order to exhibit, even more 
clearly, the relationship between classical and quantum theory, we could 
suppose that classically we are also dealing with probabilities but that 
all probabilities either are zero or one. A better alternative is to imag- 
ine in the classical case that the probabilities are in the sense of classical 


4 For our discussion it is not important that certain values of a, b, or c might be 


excluded by quantum mechanics but not by classical mechanics. For simplicity, assume 
the values are the same for both but that the probability of certain values may be zero. 
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statistical mechanics (where, possibly, internal coordinates are not com- 
pletely specified). 

We define P,, as the probability that if measurement A gave the result a, 
then measurement B will give the result b. Similarly, P,. is the probability 
that if measurement B gives the result b, then measurement C' gives c. 
Further, let Pa be the chance that if A gives a, then C gives c. Finally, 
denote by Pate the probability of all three, i.e., if A gives a, then B gives 
b, and C' gives c. If the events between a and b are independent of those 
between b and c, then 


Pabe = PavPbe : (1) 


This is true according to quantum mechanics when the statement that B 
is b is a complete specification of the state. 
In any event, we expect the relation 


Pie Ye Page (2) 
b 


This is because, if initially measurement A gives a and the system is later 
found to give the result c to measurement C, the quantity B must have 
had some value at the time intermediate to A and C. The probability 
that it was bis Pay. We sum, or integrate, over all the mutually exclusive 
alternatives for b (symbolized by X,). 

Now, the essential difference between classical and quantum physics lies 
in Eq. (2). In classical mechanics it is always true. In quantum mechanics 
it is often false. We shall denote the quantum-mechanical probability that a 
measurement of C results in c when it follows a measurement of A giving a 
by P2,. Equation (2) is replaced in quantum mechanics by this remarkable 
law: There exist complex numbers (a5, Ybe; Pac Such that 


Pab = |~as|?, Pre = \Ybel*, and Pio= eae (3) 


5 We have assumed b is a non-degenerate state, and that therefore (1) is true. Pre- 
sumably, if in some generalization of quantum mechanics (1) were not true, even for 
pure states b, (2) could be expected to be replaced by: There are complex numbers 
Pabe = |Pabc|?. The analog of (5) is then ac = Yb Pabe- 
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The classical law, obtained by combining (1) and (2), 


Pig) days (4) 
b 


is replaced by 


Pac = ys PabPbc - (5) 


b 


If (5) is correct, ordinarily (4) is incorrect. The logical error made in 
deducing (4) consisted, of course, in assuming that to get from a to c the 
system had to go through a condition such that B had to have some definite 
value, b. 

If an attempt is made to verify this, i-e., if B is measured between the 
experiments A and C, then formula (4) is, in fact, correct. More precisely, 
if the apparatus to measure B is set up and used, but no attempt is made to 
utilize the results of the B measurement in the sense that only the A to C 
correlation is recorded and studied, then (4) is correct. This is because the 
B measuring machine has done its job; if we wish, we could read the meters 
at any time without disturbing the situation any further. The experiments 
which gave a and c can, therefore, be separated into groups depending on 
the value of b. 

Looking at probability from a frequency point of view (4) simply results 
from the statement that in each experiment giving a and c, B had some 
value. The only way (4) could be wrong is the statement, “B had some 
value,” must sometimes be meaningless. Noting that (5) replaces (4) only 
under the circumstance that we make no attempt to measure B, we are 
led to say that the statement, “B had some value,” may be meaningless 
whenever we make no attempt to measure B.® 

Hence, we have different results for the correlation of a and c, namely, 
Eq. (4) or Eq. (5), depending upon whether we do or do not attempt 
to measure B. No matter how subtly one tries, the attempt to mea- 
sure B must disturb the system, at least enough to change the results 


6 Jt does not help to point out that we could have measured B had we wished. The 


fact is that we did not. 
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from those given by (5) to those of (4).’ That measurements do, in fact, 
cause the necessary disturbances, and that, essentially, (4) could be false 
was first clearly enunciated by Heisenberg in his uncertainty principle. The 
law (5) is a result of the work of Schroedinger, the statistical interpretation 
of Born and Jordan, and the transformation theory of Dirac.® 

Equation (5) is a typical representation of the wave nature of matter. 
Here, the chance of finding a particle going from a to c through several 
different routes (values of b) may, if no attempt is made to determine the 
route, be represented as the square of a sum of several complex quantities — 
one for each available route. Probability can show the typical phenomena 
of interference, usually associated with waves, whose intensity is given by 
the square of the sum of contributions from different sources. The electron 
acts as a wave, (5), so to speak, as long as no attempt is made to verify 
that it is a particle; yet one can determine, if one wishes, by what route it 
travels just as though it were a particle; but when one does that, (4) applies 
and it does act like a particle. 

These things are, of course, well known. They have already been ex- 
plained many times.? However, it seems worthwhile to emphasize the fact 
that they are all simply direct consequences of Eq. (5), for it is essentially 
Eq. (5) that is fundamental in my formulation of quantum mechanics. 

The generalization of Eqs. (4) and (5) to a large number of measure- 
ments, say A, B, C, D,...,K, is, of course, that the probability of the 
sequence a, b, c, d,...,k is 


Piped k= |Cavodckl - 


7 How (4) actually results from (5) when measurements disturb the system has been 


studied particularly by J. von Neumann (Mathematische Grundlagen der Quanten- 
mechanik (Dover Publications, New York, 1943)). The effect of perturbation of the 
measuring equipment is effectively to change the phase of the interfering components, 
by 6, say, so that (5) becomes tac = pet YabYbc- However, as von Neumann shows, 
the phase shifts must remain unknown if B is measured so that the resulting probability 
Pac is the square of Wac averaged over all phases, 6). This results in (4). 

8 If A and B are the operators corresponding to measurements A and B, and if wa 
and wp are solutions of Awa = aya and Bxy = bx», then Yan = f xfPade = (xf, Va). 
Thus, Yap is an element (a|b) of the transformation matrix for the transformation from 
a representation in which A is diagonal to one in which B is diagonal. 

9 See, for example, W. Heisenberg, The Physical Principles of the Quantum Theory 
(University of Chicago Press, Chicago, 1930), particularly Chapter IV. 
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The probability of the result a, c, k, for example, if b, d,..., are measured, 
is the classical formula: 


Pack =o Phtack, (6) 
bod 


while the probability of the same sequence a, c, k if no measurements are 
made between A and C and between C and K is 


S- SS? ‘++ Dabed---k 


b od 


2 


(7) 


The quantity Yabca-... we can call the probability amplitude for the condi- 
tion A=a, B=b,C=c, D=d,...,K =k. (It is, of course, expressible 
as a product PabPbeVed «+ - Yjk-) 


3. The Probability Amplitude for a Space-Time Path 


The physical ideas of the last section may be readily extended to define 
a probability amplitude for a particular completely specified space-time 
path. To explain how this may be done, we shall limit ourselves to a 
one-dimensional problem, as the generalization to several dimensions is 
obvious. 

Assume that we have a particle which can take up various values of 
a coordinate x. Imagine that we make an enormous number of successive 
position measurements, let us say separated by a small time interval ¢. Then 
a succession of measurements such as A, B, C,..., might be the succession 
of measurements of the coordinate x at successive times f¢1, tg, t3,..., where 
ti41 = t; +e. Let the value, which might result from measurement of the 
coordinate at time t;, be x;. Thus, if A is a measurement of x at t; then 
x is what we previously denoted by a. From a classical point of view, the 
successive values, %1,%2,%3,..., of the coordinate practically define a path 
x(t). Eventually, we expect to go the limit ¢ — 0. 

The probability of such a path is a function of 71,2%2,...,2j,..., Say 
P(...%;,€i41,-..). The probability that the path lies in a particular region 
R of space-time is obtained classically by integrating P over that region. 
Thus, the probability that x; lies between a; and b;, and x;4, lies between 
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Ai+1 and bi44, etc., is 


bi pbiai 
of i ++) P(...%4,Ui41,-..) ++ dajdxizi-:: 
a, Jai4i 


= f Pls mitieny.) ee dedie (8) 
R 


the symbol f[ p meaning that the integration is to be taken over those ranges 
of the variables which lie within the region R. This is simply Eq. (6) with 
a,b,..., replaced by x1, %2,..., and integration replacing summation. 

In quantum mechanics this is the correct formula for the case that 
%1,%2,...,2%;,..., were actually all measured, and then only those paths 
lying within R were taken. We would expect the result to be different if no 
such detailed measurements had been performed. Suppose a measurement 
is made which is capable only of determining that the path lies somewhere 
within R. 

The measurement is to be what we might call an “ideal measurement.” 
We suppose that no further details could be obtained from the same mea- 
surement without further disturbance to the system. I have not been able 
to find a precise definition. We are trying to avoid the extra uncertainties 
that must be averaged over if, for example, more information were mea- 
sured but not utilized. We wish to use Eq. (5) or Eq. (7) for all x; and have 
no residual part to sum over in the manner of Eq. (4). 

We expect that the probability that the particle is found by our “ideal 
measurement” to be, indeed, in the region R is the square of a complex 
number |,(R)|?.. The number ¢(R), which we may call the probabil- 
ity amplitude for region R is given by Eq. (7) with a,b,..., replaced by 


Xi,2j41,---, and summation replaced by integration: 
e(R) = Lim O(--- aj, @ig1-++) ++ dada: (9) 
OUR 


The complex number ®(---2;,2;41---) is a function of the variables x; 
defining the path. Actually, we imagine that the time spacing ¢ approaches 
zero so that ® essentially depends on the entire path x(t) rather than only 
on just the values of x; at the particular times t;, x; = x(t;). We might call 
® the probability amplitude functional of paths x(t). 

We may summarize these ideas in our first postulate: 
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I. If an ideal measurement is performed to determine whether a particle 
has a path lying in a region of space-time, then the probability that the result 
will be affirmative is the absolute square of a sum of complex contributions, 
one from each path in the region. 

The statement of the postulate is incomplete. The meaning of a sum 
of terms one for “each” path is ambiguous. The precise meaning given in 
Eq. (9) is this: A path is first defined only by the positions x; through which 
it goes at a sequence of equally spaced times,!° t; = t;_1+e. Then all values 
of the coordinates within R have an equal weight. The actual magnitude 
of the weight depends upon ¢ and can be so chosen that the probability of 
an event which is certain shall be normalized to unity. It may not be best 
to do so, but we have left this weight factor in a proportionality constant 
in the second postulate. The limit « — 0 must be taken at the end of a 
calculation. 

When the system has several degrees of freedom the coordinate space x 
has several dimensions so that the symbol x will represent a set of coordi- 
nates (¢),a),...,a*)) for a system with k degrees of freedom. A path is 
a sequence of configurations for successive times and is described by giving 
the configuration x; or (2,2, ae al"), i.e., the value of each of the k 
coordinates for each time t;. The symbol dx; will be understood to mean 
the volume element in k-dimensional configuration space (at time t;). The 
statement of the postulates is independent of the coordinate system which 
is used. 

The postulate is limited to defining the results of position measurements. 
It does not say what must be done to define the result of a momentum 
measurement, for example. This is not a real limitation, however, because 
in principle the measurement of momentum of one particle can be performed 
in terms of position measurements of other particles, e.g., meter indicators. 
Thus, an analysis of such an experiment will determine what it is about 


the first particle which determines its momentum. 


10 There are very interesting mathematical problems involved in the attempt to avoid 
the subdivision and limiting processes. Some sort of complex measure is being associated 
with the space of functions x(t). Finite results can be obtained under unexpected circum- 
stances because the measure is not positive everywhere, but the contributions from most 
of the paths largely cancel out. These curious mathematical problems are sidestepped 
by the subdivision process. However, one feels as Cavalieri must have felt calculating 
the volume of a pyramid before the invention of calculus. 
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4. The Calculation of the Probability Amplitude for a 
Path 


The first postulate prescribes the type of mathematical framework required 
by quantum mechanics for the calculation of probabilities. The second 
postulate gives a particular content to this framework by prescribing how 
to compute the important quantity ® for each path: 

IT. The paths contribute equally in magnitude, but the phase of their 
contribution is the classical action (in units of h); t.e., the time integral of 
the Lagrangian taken along the path. 

That is to say, the contribution ®[x(t)] from a ie a Hy 2 is pro- 
portional to exp(i/h)S[x(t)], where the action S[x(t)] = [ L(& )) dt is 
the time integral of the classical Lagrangian L(<, . me oe . ee is 
question. The Lagrangian, which may be an explicit function of the time, 
is a function of position and velocity. If we suppose it to be a quadratic 
function of the velocities, we can show the mathematical equivalence of the 
postulates here and the more usual formulation of quantum mechanics. 

To interpret the first postulate it was necessary to define a path by 
giving only the succession of points x; through which the path passes at 
successive times t;. To compute S = [ L(«,x)dt we need to know the path 
at all points, not just at «;. We shall assume that the function x(t) in the 
interval between ¢; and t;+1 is the path followed by a classical particle, with 
the Lagrangian DL, which starting from x; at t; reaches 7,41 at tj41. This 
assumption is required to interpret the second postulate for discontinuous 
paths. The quantity ®(...2;,2;41,...) can be normalized (for various ¢) if 
desired, so that the probability of an event which is certain is normalized 
to unity as e« — 0. 

There is no difficulty in carrying out the action integral because of 
the sudden changes of velocity encountered at the times ¢t; as long as L 
does not depend upon any higher time derivatives of the position than the 
first. Furthermore, unless LD is restricted in this way the end points are not 
sufficient to define the classical path. Since the classical path is the one 


which makes the action a minimum, we can write 


S= DS) S(tin1, 0), (10) 
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where 


S(vi41, 01) = tin / “ L(«(t), x(t))dt. (11) 


Written in this way, the only appeal to classical mechanics is to supply us 
with a Lagrangian function. Indeed, one could consider postulate two as 
simply saying, “® is the exponential of 2 times the integral of a real function 
of x(t) and its first time derivative.” Then the classical equations of motion 
might be derived later as the limit for large dimensions. The function of « 
and « then could be shown to be the classical Lagrangian within a constant 
factor. 

Actually, the sum in (10), even for finite ¢, is infinite and hence mean- 
ingless (because of the infinite extent of time). This reflects a further in- 
completeness of the postulates. We shall have to restrict ourselves to a 
finite, but arbitrarily long, time interval. 

Combining the two postulates and using Eq. (10), we find 
a 


y(R) = Lim | exp F 


e>0 JR 


where we have let the normalization factor be split into a factor 1/A (whose 
exact value we shall presently determine) for each instant of time. The inte- 
gration is just over those values x;, %;+1,... which lie in the region R. This 
equation, the definition (11) of S(a:41,2;), and the physical interpretation 
of |y(R)|? as the probability that the particle will be found in R, complete 
our formulation of quantum mechanics. 


5. Definition of the Wave Function 


We now proceed to show the equivalence of these postulates to the ordi- 
nary formulation of quantum mechanics. This we do in two steps. We 
show in this section how the wave function may be defined from the new 
point of view. In the next section we shall show that this function satisfies 
Schroedinger’s differential wave equation. 

We shall see that it is the possibility, (10), of expressing S as a sum, and 
hence ® as a product, of contributions from successive sections of the path, 
which leads to the possibility of defining a quantity having the properties 
of a wave function. 
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To make this clear, let us imagine that we choose a particular time t 
and divide the region R in Eq. (12) into pieces, future and past relative 
to t. We imagine that R can be split into: (a) a region R’, restricted in 
any way in space, but lying entirely earlier in time than some t’, such that 
t' < t; (b) a region R” arbitrarily restricted in space but lying entirely 
later in time than t”, such that t” > t; (c) the region between t’ and ¢” in 
which all the values of x coordinates are unrestricted, i.e., all of space-time 
between t’ and t”. The region (c) is not absolutely necessary. It can be 
taken as narrow in time as desired. However, it is convenient in letting 
us consider varying ¢ a little without having to redefine R’ and R”. Then 
|p(R’, R")|? is the probability that the path occupies R’ and R”. Because 
R’ is entirely previous to R’, considering the time t as the present, we can 
express this as the probability that the path had been in region R’ and will 
be in region R”. If we divide by a factor, the probability that the path is 
in R’, to renormalize the probability we find: |y(R’, R”)|? is the (relative) 
probability that if the system were in region R’ it will be found later in R”. 

This is, of course, the important quantity in predicting the results of 
many experiments. We prepare the system in a certain way (e.g., it was in 
region R’) and then measure some other property (e.g., will it be found in 
region R’’?). What does (12) say about computing this quantity, or rather 
the quantity y(R’, R”) of which it is the square? 

Let us suppose in Eq. (12) that the time t corresponds to one particular 
point k of the subdivision of time into steps ¢, i.e., assume t = tz, the index 
k, of course, depending upon the subdivision «. Then, the exponential 
being the exponential of a sum may be split into a product of two factors 


. 0 . ko 

exp ; » S(xiqi, “i “exp i a, S(xiaa, “i : (13) 

The first factor contains only coordinates with index k or higher, while 
the second contains only coordinates with index k or lower. This split is 
possible because of Eq. (10), which results essentially from the fact that 
the Lagrangian is a function only of positions and velocities. First, the 
integration on all variables x; for 1 > k can be performed on the first factor 
resulting in a function of x; (times the second factor). Next, the integration 
on all variables x; for i < k can be performed on the second factor also, 
giving a function of x,. Finally, the integration on x, can be performed. 
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That is, yp(R’, R”) can be written as the integral over x, of the product of 
two factors. We will call these y*(#,,t) and (ax, t): 


y(R’, R”) = [ x2.)u@,Hae, (14) 
where 
eS dx dx 
Ge a k-1G2p-2 
V(ee,t) = Lim f exp > S(ix1,1)] -—G- > (15) 
and 
* _ TT: a oS : : 1 ALR dxrR+2 
x (vx, t) = Lim res FX Sen 7 cay eae ne (16) 


The symbol R’ is placed on the integral for w to indicate that the coor- 
dinates are integrated over the region R’, and, for t; between t’ and t, over 
all space. In like manner, the integral for y* is over R” and over all space 
for those coordinates corresponding to times between t and t”. The asterisk 
on x* denotes complex conjugate, as it will be found more convenient to 
define (16) as the complex conjugate of some quantity, y. 

The quantity ~ depends only upon the region R’ previous to t, and is 
completely defined if that region is known. It does not depend, in any way, 
upon what will be done to the system after time t. This latter information 
is contained in x. Thus, with ~ and y we have separated the past history 
from the future experiences of the system. This permits us to speak of the 
relation of past and future in the conventional manner. Thus, if a particle 
has been in a region of space-time R’ it may at time ¢ be said to be ina 
certain condition, or state, determined only by its past and described by the 
so-called were function w(x, t). This function contains all that is needed to 
predict future probabilities. For suppose, in another situation, the region 
R’ were different, say r’, and possibly the Lagrangian for times before t were 
also altered. But, neverthelss, suppose the quantity from Eq. (15) turned 
out to be the same. Then, according to (14) the probability of ending in 
any region R” is the same for R’ as for r’. Therefore, future measurements 
will not distinguish whether the system had occupied R’ or r’. Thus, the 
wave function 7)(2,t) is sufficient to define those attributes which are left 
from past history which determine future behavior. 
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Likewise, the function .*(z,t) characterizes the experience, or, let us 
say, experiment to which the system is to be subjected. If a different 
region, r”’ and different Lagrangian after t, were to give the same x*(z,t) 
via Eq. (16), as does region R”, then no matter what the preparation, ~, 
Eq. (14) says that the chance of finding the system in R” is always the 
same as finding it in r’”. The two “experiments” R” and r” are equivalent, 
as they yield the same results. We shall say loosely that these experiments 
are to determine with what probability the system is in state y. Actually, 
this terminology is poor. The system is really in state ~. The reason we 
can associate a state with an experiment is, of course, that for an ideal 
experiment there turns out to be a unique state (whose wave function is 
x(a, t)) for which the experiment succeeds with certainty. 

Thus, we can say: the probability that a system in state ~ will be 
found by an experiment whose characteristic state is y (or, more loosely, 
the chance that a system in state w will appear to be in y) is 


[x eve.nae ; ; (17) 


These results agree, of course, with the principles of ordinary quantum 
mechanics. They are a consequence of the fact that the Lagrangian is a 
function of position, velocity, and time only. 


6. The Wave Equation 


To complete the proof of the equivalence with the ordinary formulation 
we shall have to show that the wave function defined in the previous sec- 
tion by Eq. (15) actually satisfies the Schroedinger wave equation. Actu- 
ally, we shall only succeed in doing this when the Lagrangian L in (11) 
is a quadratic, but perhaps inhomogeneous, form in the velocities x(t). 
This is not a limitation, however, as it includes all the cases for which the 
Schroedinger equation has been verified by experiment. 

The wave equation describes the development of the wave function with 
time. We may expect to approach it by noting that, for finite e, Eq. (15) 
permits a simple recursive relation to be developed. Consider the appear- 
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ance of Eq. (15) if we were to compute w at the next instant of time: 


F k 
a 


W(Te41,¢ + €) =f exp i »; S(xi41, Zi) 


1=— 00 


WOT 1,5 
A A 


(15") 


This is similar to (15) except for the integration over the additional variable 
xz, and the extra term in the sum in the exponent. This term means that 
the integral of (15’) is the same as the integral of (15) except for the factor 
(1/A) exp(t/R)S(ax41, 0%). Since this does not contain any of the variables 
a; for 7 less than k, all of the integrations on dz; up to dxzz_1 can be 
performed with this factor left out. However, the result of these integrations 
is by (15) simply (ax, t). Hence, we find from (15’) the relation 


IP A= / es FS (eur 20)| Ui deTA: G8) 


This relation giving the development of w with time will be shown, for sim- 
ple examples, with suitable choice of A, to be equivalent to Schroedinger’s 
equation. Actually, Eq. (18) is not exact, but is only true in the limit « > 0 
and we shall derive the Schroedinger equation by assuming (18) is valid for 
first order in ¢. Equation (18) need only be true for small € to the first 
order in ¢. For if we consider the factors in (15) which carry us over a finite 
interval of time, T, the number of factors is T/e. If an error of order €? 
is made in each, the resulting error will not accumulate beyond the order 
€*(T/e) or T-, which vanishes in the limit. 

We shall illustrate the relation of (18) to Schroedinger’s equation by 
applying it to the simple case of a particle moving in one dimension in a 
potential V(x). Before we do this, however, we would like to discuss some 
approximations to the value $(7;+41,2;) given in (11) which will be sufficient 
for expression (18). 

The expression defined in (11) for S(aj41,2;) is difficult to cal- 
culate exactly for arbitrary ¢ from classical mechanics. Actually, it 
is only necessary that an approximate expression for S$(xj;41,27;) be 
used in (18), provided the error of the approximation be of an or- 
der smaller than the first in ¢. We limit ourselves to the case that 
the Lagrangian is a quadratic, but perhaps inhomogeneous, form in 
the velocities (t). As we shall see later, the paths which are important are 
those for which 7;4; — x; is of order e2. Under these circumstances, it is 
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sufficient to calculate the integral in (11) over the classical path taken by 
a free particle.!! 

In Cartesian coordinates'* the path of a free particle is a straight line so 
the integral of (11) can be taken along a straight line. Under these circum- 
stances it is sufficiently accurate to replace the integral by the trapezoidal 
rule 


E i41 — Ti E i41 — Vi 
S(wissa) = EL (S23 =" 20) +Sn (2. 0,) (19) 


or, if it proves more convenient, 


Li41— Lj Li41 + *) (20) 


S(%i41, 05) = eL | ——_., 

( i+1 i) ( ec 2 
These are not valid in a general coordinate system, e.g., spherical. An 
even simpler approximation may be used if, in addition, there is no vector 


potential or other terms linear in the velocity (see p. 89): 


S(xi41, 2) =eL (S28 =, ou) : (21) 


Thus, for the simple example of a particle of mass m moving in one 
dimension under a potential V (a), we can set 


2 
ME Vi — %y 
S(ai41,21) => (S4-*) — eV (ai41). (22) 


For this example, then, Eq. (18) becomes 


teptta= [ew Re (a=) - Vers} Dede A: 
(23) 


Let us call vg41 = © and @p41 — &e = € so that x, = x —€. Then (23) 


11 Tt is assumed that the “forces” enter through a scalar and vector potential and not 
in terms involving the square of the velocity. More generally, what is meant by a free 
particle is one for which the Lagrangian is altered by omission of the terms linear in, 
and those independent of, the velocities. 

12 More generally, coordinates for which the terms quadratic in the velocity in L(#, x) 
appear with constant coefficients. 
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becomes 


wa-EnS. (24) 


Y(a,tt+e) = [ex ee exp TW) 


The integral on € will converge if ¢(z, ¢) falls off sufficiently for large x 
(certainly if f *(x)y(x)dx = 1). In the integration of €, since € is very 
small, the exponential of im&?/2he oscillates extremely rapidly except in 
the region about € = 0 (€ of order (he/m)?). Since the function u(a — €,t) 
is a relatively smooth function of € (since « may be taken as small as de- 
sired), the region where the exponential oscillates rapidly will contribute 
very little because of the almost complete cancelation of positive and neg- 
ative contributions. Since only small € are effective, w(a — €,t) may be 
expanded as a Taylor series. Hence, 


w(az,t+ €) = exp (=) jew eS 


x fee = We + — - | d&/A. (25) 
Now 
/ . exp(im€? /2he)d& = (2thei/m)? , 
‘i : exp(im€? /2Qhe)Edé = 0, (26) 


/ _ exp(im€? /2he)€7°dé = (hei/m)(2rhei/m)? , 


—oco 


while the integral containing £3 is zero, for like the one with € it possesses 
an odd integrand, and the ones with €4 are of at least the order € smaller 
than the ones kept here.'? If we expand the left-hand side to first order 


13 Really, these integrals are oscillatory and not defined, but they may be defined by 
using a convergence factor. Such a factor is automatically provided by 7(x—e, t) in (24). 
If a more formal procedure is desired replace h by h(1 — id), for example, where 6 is a 
small positive number, and then let 6 > 0. 
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in €, (25) becomes 


O(x, t) —ieV(a)\ (2nhei/m)? 
w(a,t) + oT a = exp (=) a 
x [uct + ere | 2 fC) 


In order that both sides may agree to zero order in €, we must set 
A = (2nhei/m)? . (28) 
Then expanding the exponential containing V(x), we get 


hei x) . 


Canceling w(z,t) from both sides, and comparing terms to first order in € 
and multiplying by —A/2 one obtains 


ha 1 (hay? 
= (45) v+V(a)y, (30) 
which is Schroedinger’s equation for the problem in question. 

The equation for .~* can be developed in the same way, but adding a 
factor decreases the time by one step, i.e., x* satisfies an equation like (30) 
but with the sign of the time reversed. By taking complex conjugates we 
can conclude that y satisfies the same equation as y, i.e., an experiment 
can be defined by the particular state y to which it corresponds.'4 

This example shows that most of the contribution to (v.41, t-+e) comes 
from values of x, in w(x, t) which are quite close to 7,41 (distant of order 
€2) so that the integral equation (23) can, in the limit, be replaced by a 
differential equation. The “velocities,” («,+1 — 2,)/e which are important 
are very high, being of order (fi/ me)? which diverges as ¢ — 0. The paths 
involved are, therefore continuous but possess no derivative. They are of a 
type familiar from study of Brownian motion. 


14 Dr. Hartland Snyder has pointed out to me, in private conversation, the very inter- 
esting possibility that there may be a generalization of quantum mechanics in which the 
states measured by experiment cannot be prepared; that is, there would be no state into 
which a system may be put for which a particular experiment gives certainty for a result. 
The class of functions x is not identical to the class of available states ~. This would 
result if, for example, x satisfied a different equation than w. 
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It is these large velocities which make it so necessary to be careful in 
approximating $(2p41, 2%) from Eq. (11).1° To replace V(arp41) by V(x) 
would, of course, change the exponent in (18) by ie[V(a,) — V(an41)]/h 
which is of order ¢(x,41 — x%), and thus lead to unimportant terms of 
higher order than ¢ on the right-hand side of (29). It is for this reason that 
(20) and (21) are equally satisfactory approximations to S(#;41,2;) when 
there is no vector potential. A term, linear in velocity, however, arising from 
a vector potential, as Azdt must be handled more carefully. Here a term 
in S(xvp41,2~) such as A(ag41)(Ue41 — Ve) differs from A(x,) (E41 — Le) 
by a term of order (a%41 — xp)”, and, therefore, of order c. Such a term 
would lead to a change in the resulting wave equation. For this reason the 
approximation (21) is not a sufficiently accurate approximation to (11) and 
one like (20), (or (19) from which (20) differs by terms of order higher than 
€) must be used. If A represents the vector potential and p = (f/i)V, the 
momentum operator, then (20) gives, in the Hamiltonian operator, a term 
(1/2m)(p—(e/e)A)-(p—(e/c)A), while (21) gives (1/2m)(p-p—(2e/c)A- 
p + (e?/c?)A-A). These two expressions differ by (he/2imc)V - A which 
may not be zero. The question is still more important in the coefficient 
of terms which are quadratic in the velocities. In these terms (19) and 
(20) are not sufficiently accurate representations of (11) in general. It is 
when the coefficients are constant that (19) or (20) can be substituted for 
(11). If an expression such as (19) is used, say for spherical coordinates, 
when it is not a valid approximation to (11), one obtains a Schroedinger 
equation in which the Hamiltonian operator has some of the momentum 
operators and coordinates in the wrong order. Equation (11) then resolves 
the ambiguity in the usual rule to replace p and q by the non-commuting 
quantities (4/i)(0/0q) and q in the classical Hamiltonian H(p, q). 

It is clear that the statement (11) is independent of the coordinate 
system. Therefore, to find the differential wave equation it gives in any 


15 Equation (18) is actually exact when (11) is used for S(xi41,2;) for arbitrary € for 
cases in which the potential does not involve x to higher powers than the second (e.g., 
free particle, harmonic oscillator). It is necessary, however, to use a more accurate value 
of A. One can define A in this way. Assume classical particles with k degrees of freedom 
start from the point x;, t; with uniform density in momentum space. Write the number of 
particles having a given component of momentum in range dp as dp/po with po constant. 
Then A = (2nhi/po)*/2 9-2, where p is the density in k-dimensional coordinate space 
x;4+41 of these particles at time tj+1. 
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coordinate system, the easiest procedure is first to find the equations in 
Cartesian coordinates and then to transform the coordinate system to the 
one desired. It suffices, therefore, to show the relation of the postulates and 
Schroedinger’s equation in rectangular coordinates. 

The derivation given here for one dimension can be extended directly 
to the case of three-dimensional Cartesian coordinates for any number, 
kK, of particles interacting through potentials with one another, and in 
a magnetic field, described by a vector potential. The terms in the vec- 
tor potential require completing the square in the exponent in the usual 
way for Gaussian integrals. The variable x must be replaced by the set 


3K) where a), 7), x) are the coordinates of the first particle 


c to af 
of mass m1, x), «©, © of the second of mass m2, etc. The symbol 
dx is replaced by da dx®) ...da@®), and the integration over dx is re- 
placed by a 3K-fold integral. The constant A has, in this case, the value 
A = (2nhei/m,)?(2thei/m2)? ---(Qrhei/mx)?. The Lagrangian is the 
classical Lagrangian for the same problem, and the Schroedinger equation 
resulting will be that which corresponds to the classical Hamiltonian, de- 
rived from this Lagrangian. The equations in any other coordinate system 
may be obtained by transformation. Since this includes all cases for which 
Schroedinger’s equation has been checked with experiment, we may say our 
postulates are able to describe what can be described by non-relativistic 
quantum mechanics, neglecting spin. 


7. Discussion of the Wave Equation: The Classical 
Limit 


This completes the demonstration of the equivalence of the new and old 
formulations. We should like to include in this section a few remarks about 
the important equation (18). 

This equation gives the development of the wave function during a small 
time interval. It is easily interpreted physically as the expression of Huy- 
gens’ principle for matter waves. In geometrical optics the rays in an in- 
homogeneous medium satisfy Fermat’s principle of least time. We may 
state Huygens’ principle in wave optics in this way: If the amplitude of the 
wave is known on a given surface, the amplitude at a near by point can be 
considered as a sum of contributions from all points of the surface. Each 
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contribution is delayed in phase by an amount proportional to the time 
it would take the light to get from the surface to the point along the ray 
of least time of geometrical optics. We can consider (22) in an analogous 
manner starting with Hamilton’s first principle of least action for classical 
or “geometrical” mechanics. If the amplitude of the wave w is known on a 
given “surface,” in particular the “surface” consisting of all x at time t, its 
value at a particular nearby point at time t+ ¢, is a sum of contributions 
from all points of the surface at t. Each contribution is delayed in phase 
by an amount proportional to the action it would require to get from the 
surface to the point along the path of least action of classical mechanics. !® 

Actually Huygens’ principle is not correct in optics. It is replaced by 
Kirchoff’s modification which requires that both the amplitude and its 
derivative must be known on the adjacent surface. This is a consequence of 
the fact that the wave equation in optics is second order in the time. The 
wave equation of quantum mechanics is first order in the time; therefore, 
Huygens’ principle is correct for matter waves, action replacing time. 

The equation can also be compared mathematically to quantities ap- 
pearing in the usual formulations. In Schroedinger’s method the develop- 
ment of the wave function with time is given by 


-74, =Hy, (31) 
which has the solution (for any ¢ if H is time independent) 
w(a,t +e) = exp(—icH/h)W(a,t). (32) 


Therefore, Eq. (18) expresses the operator exp(—i¢Hh) by an approximate 
integral operator for small e. 

From the point of view of Heisenberg one considers the position at time 
t, for example, as an operator x. The position x’ at a later time t+ € can 
be expressed in terms of that at time t by the operator equation 


x’ = exp(i¢H/h)x exp —(icH/h) . (33) 


The transformation theory of Dirac allows us to consider the wave function 
at time t+ ¢, U(a’,t + €), as representing a state in a representation in 


16 See in this connection the very interesting remarks of Schroedinger, Ann. d. Physik 
79, 489 (1926). 
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which x’ is diagonal, while u(a,t) represents the same state in a repre- 
sentation in which x is diagonal. They are, therefore, related through the 
transformation function (2’|x)- which relates these representations: 


Wa',t+e)= [Oo eile. tae. 
Therefore, the content of Eq. (18) is to show that for small ¢ we can set 
(2'|x)2 = (1/A) exp(iS(2", x)/h) (34) 


with S(x’,x) defined as in (11). 

The close analogy between (a’|”)- and the quantity exp(iS(2’, x) /h) 
has been pointed out on several occasions by Dirac.! In fact, we now see 
that to sufficient approximations the two quantities may be taken to be 
proportional to each other. Dirac’s remarks were the starting point of the 
present development. The points he makes concerning the passage to the 
classical limit h — 0 are very beautiful, and I may perhaps the excused for 
briefly reviewing them here. 

First we note that the wave function at 2” at time ¢” can be obtained 
from that at x’ at time t’ by 


" ce ai p25 * gi adits i Bhs 
We se) sue exp |= oe (ri41, 21) CE ray ga “=. 


where we put 2p = 2’ and 2; = 2” where je = t” —t’ (between the times t’ 
and t” we assume no restriction is being put on the region of integration). 
This can be seen either by repeated applications of (18) or directly from 
Eq. (15). Now we ask, as h — 0 what values of the intermediate coordinates 
x; contribute most strongly to the integral? These will be the values most 
likely to be found by experiment and therefore will determine, in the limit, 
the classical path. If A is very small, the exponent will be a very rapidly 
varying function of any of its variables x;. As x; varies, the positive and 
negative contributions of the exponent nearly cancel. The region at which 
x; contributes most strongly is that at which the phase of the exponent 
varies least rapidly with x; (method of stationary phase). Call the sum in 
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the exponent S; 


S=) S(xi41, 21). (36) 
i=0 
Then the classical orbit passes, approximately, through those points x; at 
which the rate of change of S with x; is small, or in the limit of small h, 
zero, i.e., the classical orbit passes through the points at which 05/0x; = 0 
for all x;. Taking the limit ¢ — 0, (36) becomes in view of (11) 


S= ij. L(a(t), x(t))dt. (37) 


We see then that the classical path is that for which the integral (37) suffers 
no first-order change on varying the path. This is Hamilton’s principle and 
leads directly to the Lagrangian equations of motion. 


8. Operator Algebra: Matrix Elements 


Given the wave function and Schroedinger’s equation, of course all of the 
machinery of operator or matrix algebra can be developed. It is, however, 
rather interesting to express these concepts in a somewhat different lan- 
guage more closely related to that used in stating the postulates. Little 
will be gained by this in elucidating operator algebra. In fact, the results 
are simply a translation of simple operator equations into a somewhat more 
cumbersome notation. On the other hand, the new notation and point of 
view are very useful in certain applications described in the introduction. 
Furthermore, the form of the equations permits natural extension to a wider 
class of operators than is usually considered (e.g., ones involving quantities 
referring to two or more different times). If any generalization to a wider 
class of action functionals is possible, the formulae to be developed will play 
an important role. 

We discuss these points in the next three sections. This section is 
concerned mainly with definitions. We shall define a quantity which we 
call a transition element between two states. It is essentially a matrix 
element. But instead of being the matrix element between a state q and 
another y corresponding to the same time, these two states will refer to 
different times. In the following section a fundamental relation between 
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transition elements will be developed from which the usual commutation 
rules between coordinate and momentum may be deduced. The same re- 
lation also yields Newton’s equation of motion in matrix form. Finally, in 
Sec. 10 we discuss the relation of the Hamiltonian to the operation of dis- 
placement in time. 

We begin by defining a transition element in terms of the probability 
of transition from one state to another. More precisely, suppose we have a 
situation similar to that described in deriving (17). The region R consists 
of a region R’ previous to t’, all space between ¢’ and t¢” and the region R” 


after t’. We shall study the probability that a system in region R’ is later 
found in region R”. This is given by (17). We shall discuss in this section 
how it changes with changes in the form of the Lagrangian between t’ and 
t”. In Sec. 10 we discuss how it changes with changes in the preparation 
R’ or the experiment R”. 

The state at time ¢’ is defined completely by the preparation R’. It can 
be specified by a wave function ~(2’,t’) obtained as in (15), but containing 
only integrals up to the time t’. Likewise, the state characteristic of the 
experiment (region R”) can be defined by a function y(x”,t”) obtained 
from (16) with integrals only beyond t’’. The wave function (a”,t”) at 
time t” can, of course, also be gotten by appropriate use of (15). It can also 
be gotten from ~(2’,t’) by (35). According to (17) with t” used instead of 
t, the probability of being found in y if prepared in w is the square of what 
we shall call the transition amplitude [ y*(2",t”)y(a",t”)dx"”. We wish to 
express this is terms of y at t” and 7 at t’. This we can do with the aid 
of (35). Thus, the chance that a system prepared in state v4 at time t’ 
will be found after ¢” to be in a state y,” is the square of the transition 
amplitude 


(xer|tde)s = Lim fs [xe (e", 09) exp(is/ nota) 2 a, 


where we have used the abbreviation (36). 

In the language of ordinary quantum mechanics if the Hamiltonian, H, 
is constant, (2, t’”) = exp[—i(t” —t/)H/h]w(z, t’) so that (38) is the matrix 
element of exp[—i(t” — t’)H/h] between states x and yy. 

If F is any function of the coordinates x; for t! < t; < t’’, we shall define 


Space-Time Approach to Non-Relativistic Quantum Mechanics 95 


the transition element of F' between the states w at t’ and y at t” for the 
action S as (a@” = 2;,x' = x0): 


inte = Big f= fe APU 8) 
oc JN 


a 
-exp i 5 Li41, Li) 


In the limit ¢ — 0, F is a functional of the path x(t). 


We shall see presently why such quantities are important. It will be 
easier to understand if we stop for a moment to find out what the quantities 
correspond to in conventional notation. Suppose F’ is simply x, where k 
corresponds to some time t = t,. Then on the right-hand side of (39) 
the integrals from 29 to x,-1 may be performed to produce W(ax,t) or 
exp[—i(t — t’)H/f]yy. In like manner the integrals on x; for 7 > 71> k give 
x" (ap, t) or {exp[—7(t” — t)H/hilyy}*. Thus, the transition element of xx, 


(ve |Fl ve) s = [ee EOD pCO pecs 


= iis (a, thaw(a, t)dx (40) 


is the matrix element of x at time t = tz between the state which would 
develop at time t from 7 at t’ and the state which will develop from time 
t to ye at t”. It is, therefore, the matrix element of x(t) between these 
states. 

Likewise, according to (39) with F = x41, the transition element of 
Le41 is the matrix element of x(t+¢). The transition element of F = 
(te41—Xx)/€ is the matrix element of (x(t+¢)—x(t)) /e or of i(/Hx—xH)/h, 
as is easily shown from (40). We can call this the matrix element of velocity 
x(t). 

Suppose we consider a second problem which differs from the first be- 
cause, for example, the potential is augmented by a small amount U(, xt). 
Then in the new problem the quantity replacing S is S$’ = $+ dj;eU(a;,t;). 
Substitution into (38) leads directly to 


ae OF 
exp = > U(ai,ts) w) : (41) 
s 


i=l 


(xe |L der) sr = (w 
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Thus, transition elements such as (39) are important insofar as F may 
arise in some way from a change 0S in an action expression. We denote, 
by observable functionals, those functionals F’ which can be defined, (pos- 
sibly indirectly) in terms of the changes which are produced by possible 
changes in the action S. The condition that a functional be observable 
is somewhat similar to the condition that an operator be Hermitian. The 
observable functionals are a restricted class because the action must remain 
a quadratic function of velocities. From one observable functional others 
may be derived, for example, by 


J 
vE 
(xen |F We) 30 — (w Fexp mee (a;,t 


a]ee) (42) 
s 
which is obtained from (39). 

Incidentally, (41) leads directly to an important perturbation formula. 
If the effect of U is small the exponential can be expanded to first order in 


U and we find 
dU vi, ti |e) . (43) 


Of particular importance is the case that x, is a state in which yy would 
not be found at all were it not for the disturbance, U (i.e., (x4|1|Y)s = 0). 


Then 
af Xt" a a (aj, af 
S 


is the probability of transition as induced to first order by the perturbation. 


(xe [Leber )sr = (xer Ler) s + = ae Xe 


(44) 


In ordinary notation, 


( Xe Su Dist; |) = ff fatervmneou. —(i/h)H(t—-t war) a 
S 


so that (44) reduces to the usual expression!” for time dependent 


perturbations. 


17 P. A.M. Dirac, The Principles of Quantum Mechanics (The Clarendon Press, Oxford, 
1935), second edition, Sec. 47, Eq. (20). 
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9. Newton’s Equations: The Commutation Relation 


In this section we find that different functionals may give identical results 
when taken between any two states. This equivalence between functionals 
is the statement of operator equations in the new language. 

If F depends on the various coordinates, we can, of course, define a new 
functional OF /Ox,, by differentiating it with respect to one of its variables, 
say 14 (0 < k < J). If we calculate (xi |OF /Oxx|W) 9 by (39) the integral 
on the right-hand side will contain 0F'/0x;. The only other place that the 
variable x, appears is in S. Thus, the integration on x, can be performed 
by parts. The integrated part vanishes (assuming wave functions vanish 
at infinity) and we are left with the quantity —F(0/0x,) exp(iS/h) in the 
integral. However, (0/0zr,) exp(tS/h) = (i/h)(OS/0zx,) exp(iS/h), so the 
right side represents the transition element of —(i/h)F(0S/0xx), ie., 


de ee 
Xt" or t! a hh Xt" 


This very important relation shows that two different functionals may give 


Os 
) je 
OLK 


we) (45) 


the same result for the transition element between any two states. We say 
they are equivalent and symbolize the relation by 


h OF Os 
ae ee ona 49) 


the symbol ba emphasizing the fact that functionals equivalent under one 
action may not be equivalent under another. The quantities in (46) need 
not be observable. The equivalence is, nevertheless, true. Making use of 
(36) one can write 
Bee OS(@p41, Uk) us OS (xp, R41) (47) 
1 Ox, S OxK OxK 

This equation is true to zero and first order in ¢ and has as consequences 
the commutation relations of momentum and coordinate, as well as the 
Newtonian equations of motion in matrix form. 

In the case of our simple one-dimensional problem, $(a;41,2;) is given 
by the expression (15), so that 


OS(@p41, Uk) /OXE = —M(LE41 — Le)/E, 
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and 

OS(@p, p41) /Ox~ = +M(axE — TR-1)/e — EV" (ae); 
where we write V’(a) for the derivative of the potential, or force. Then 
(47) becomes 


h OF Le+1— Le Lk - XLK-1 i 
toe gk |-m (Aaa oes) —€V"(zx)| - (48) 


If F does not depend on the variable x,, this gives Newton’s equations of 
motion. For example, if F' is constant, say unity, (48) just gives (dividing 
by €) 


0a ae (4 ied — i ia ae = 1) _ V' (ar) ‘ 
€ € 
Thus, the transition element of mass times acceleration [(a,41 — tp) /é — 
(a~ — &p-1)/e]/e between any two states is equal to the transition element 
of force —V'(x,) between the same states. This is the matrix expression of 
Newton’s law which holds in quantum mechanics. 

What happens if F' does depend upon z;? For example, let F = 2x. 
Then (48) gives, since OF /Ox; = 1, 


A - — Lp 
—- +O rE [-m (a Bil — at Maa 5 :) = ev'(as) 
4 Ss eS E 


or, neglecting terms of order e, 


—_ — UE h 
m (SH) rym (SE) eee (49) 


In order to transfer an equation such as (49) into conventional notation, 
we shall have to discover what matrix corresponds to a quantity such as 
Cpee41- It is clear from a study of (39) that if F is set equal to, say, 
f(ax)g(%x41), the corresponding operator in (40) is 


er) ge OEE Fe Ran 2 


the matrix element being taken between the states yz and wy. The op- 
erators corresponding to functions of 2,41 will appear to the left of the 
operators corresponding to functions of xz, i-e., the order of terms in a ma- 
triz operator product corresponds to an order in time of the corresponding 


factors in a functional. Thus, if the functional can and is written in such a 
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way that in each term factors corresponding to later times appear to the left 
of factors corresponding to earlier terms, the corresponding operator can 
immediately be written down if the order of the operators is kept the same 
as in the functional.'® Obviously, the order of factors in a functional is of 
no consequence. The ordering just facilitates translation into conventional 
operator notation. To write Eq. (49) in the way desired for easy translation 
would require the factors in the second term on the left to be reversed in 
order. We see, therefore, that it corresponds to 


px —xp=h/i 


where we have written p for the operator mx. 

The relation between functionals and the corresponding operators is 
defined above in terms of the order of the factors in time. It should be re- 
marked that this rule must be especially carefully adhered to when quanti- 
ties involving velocities or higher derivatives are involved. The correct func- 
tional to represent the operator (#)? is actually (p41 —2x)/€-(@~ —p_-1)/€ 
rather than [(a,41 — vx)/e]?. The latter quantity diverges as 1/e as 
e — 0. This may be seen by replacing the second term in (49) by its 
value tp41° (p41 — LE)/e calculated an instant ¢ later in time. This 
does not change the equation to zero order in ¢. We then obtain (dividing 
by ¢) 


(sat) (50) 


E S ~wme 


This gives the result expressed earlier that the root mean square of the 
“velocity” (p41 — @,)/e between two successive positions of the path is of 
order €~?. 

It will not do then to write the functional for kinetic energy, say, 
simply as 


snl(we — 24) /eP (51) 


for this quantity is infinite as « — 0. In fact, it is not an observable 
functional. 


18 Dirac has also studied operators containing quantities referring to different times. 
See reference 2. 
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One can obtain the kinetic energy as an observable functional by consid- 
ering the first-order change in transition amplitude occasioned by a change 
in the mass of the particle. Let m be changed to m(1 + 6) for a short 
time, say ¢, around t;. The change in the action is $dem[(xz41 — rx)/€]” 
the derivative of which gives an expression like (51). But the change in m 
changes the normalization constant 1/A corresponding to dx, as well as the 
action. The constant is changed from (2ahei/m)~? to (2rhei/m(1+6))72 
or by 15(2nhei/m)~? to first order in 6. The total effect of the change in 
mass in Eq. (38) to the first order in 6 is 


(xe 


We expect the change of order 6 lasting for a time ¢ to be of order de. 


sécimn| (a1 = te) fe]? /h+ 5 vv) A 


Hence, dividing by dci/h, we can define the kinetic energy functional as 


K.E. = sin[(r+s — ap) /e]? + K/2ei. (52) 


This is finite as « — 0 in view of (50). By making use of an equation which 
results from substituting m(a,41 — x,)/e for F in (48) we can also show 
that the expression (52) is equal (to order ¢) to 


ise (4-1) pea . (53) 
2 € € 


That is, the easiest way to produce observable functionals involving powers 
of the velocities is to replace these powers by a product of velocities, each 
factor of which is taken at a slightly different time. 


10. The Hamiltonian: Momentum 


The Hamiltonian operator is of central importance in the usual formula- 
tion of quantum mechanics. We shall study in this section the functional 
corresponding to this operator. We could immediately define the Hamil- 
tonian functional by adding the kinetic energy functional (52) or (53) to 
the potential energy. This method is artificial and does not exhibit the 
important relationship of the Hamiltonian to time. We shall define the 
Hamiltonian functional by the changes made in a state when it is displaced 


in time. 
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To do this we shall have to digress a moment to point out that the 
subdivision of time into equal intervals is not necessary. Clearly, any subdi- 
vision into instants t; will be satisfactory; the limits are to be taken as the 
largest spacing, t;41 —t;, approaches zero. The total action S must now be 
represented as a sum 


S= S- S (i115 tina; i, ts) ; (54) 
where 


titi 

Seiten) = [Dl e()de, (55) 
ti 

the integral being taken along the classical path between x; at t; and x4, at 

ti41. For the simple one-dimensional example this becomes, with sufficient 

accuracy, 


2, 
Sat teh) = (3 oa) = view} (ti41 —ti); (56) 


tiga — bj 


the corresponding normalization constant for integration on dx; is A = 
(Qrhi(tiy1 —t;)/m)~2. 

The relation of H to the change in a state with displacement in time 
can now be studied. Consider a state 7(t) defined by a space-time region 
R’. Now imagine that we consider another state at time t, s(t), defined 
by another region Rs. Suppose the region Rj is exactly the same as R’ 
except that it is earlier by a time 4, i.e., displaced bodily toward the past 
by a time 6. All the apparatus to prepare the system for Rj, is identical 
to that for R’ but is operated a time 6 sooner. If ZL depends explicitly 
on time, it, too, is to be displaced, i.e., the state ws is obtained from 
the L used for state q except that the time t in Ls is replaced by t+ 0. 
We ask how does the state ws differe from ~? In any measurement the 
chance of finding the system in a fixed region R” is different for R’ and 
R’,. Consider the change in the transition element (x|1|Ws)5, produced by 
the shift 6. We can consider this shift as effected by decreasing all values 
of t; by 6 for i < k and leaving all t; fixed for i > k, where the time t 
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lies in the interval between t,41 and t,.!® This change will have no effect 
on $(xi41,ti41; Vi, ti) as defined by (55) as long as both t,,, and ¢; are 
changed by the same amount. On the other hand, S(ax41,th41; Uk, tk) is 
changed to S(xg41,th+1; Uk, tk — 6). The constant 1/A for the integration 
on dz, is also altered to (2rhi(tpy1 — te + 6)/m)~2. The effect of these 
changes on the transition element is given to the first order in 6 by 


10 
(xlllv)s — (xl1¥s) 55 = 5 (xlHaly)s 5 (57) 
here the Hamiltonian functional Hy, is defined by 
OS (@p41,tk+13 Lk, te) h 
A, = oe - +-— ——_. 58 
0 otk Hise) 78) 


The last term is due to the change in 1/A and serves to keep H;, finite as 
€ + 0. For example, for the expression (56) this becomes 


2 
m ( Lpr1 — Xk h 
iS ae aes (eae eal) a ey 

ae (Gu) Le Cre ae (te+1), 


which is just the sum of the kinetic energy functional (52) and that of the 
potential energy V (ax41). 

The wave function w5(x,t) represents, of course, the same state as 
w(a,t) will be after time 6, ie., U(a,t +6). Hence, (57) is intimately 
related to the operator equation (31). 

One could also consider changes occasioned by a time shift in the final 
state vy. Of course, nothing new results in this way for it is only the relative 
shift of x and ~ which counts. One obtains an alternative expression 

Hy = —25@@et tert tert) (59) 
Otk+1 2i(thsa — tk) 
This differs from (58) only by terms of order ¢. 

The time rate of change of a functional can be computed by considering 

the effect of shifting both initial and final state together. This has the same 


19 From the point of view of mathematical rigor, if 5 is finite, as e — O one gets 
into difficulty in that, for example, the interval t,41 — t, is kept finite. This can be 
straightened out by assuming 6 to vary with time and to be turned on smoothly before 
t = t, and turned off smoothly after t = t,. Then keeping the time variation of 6 fixed, 
let e — 0. Then seek the first-order change as 6 — 0. The result is essentially the same 
as that of the crude procedure used above. 
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effect as calculating the transition element of the functional referring to a 
later time. What results is the analog of the operator equation 


he 

—f = Hf — fH. 

Z 

The momentum functional p, can be defined in an analagous way by 
considering the changes made by displacements of position: 


iA 
(xlllb)s — xlllva)sa =  (xipalv)s - 


The state wa is prepared from a region R, which is identical to region 
R’ except that it is moved a distance A in space. (The Lagrangian, if it 
depends explicitly on x, must be altered to La = L(&,a — A) for times 
previous to t.) One finds”? 


OS(@K41, Lk) OS(@K41;, Lk) 
acre ae ae 
Since wa(za,t) is equal to y(a — A,t), the close connection between pz and 
the x-derivative of the wave function is established. 

Angular momentum operators are related in an analogous way to 
rotations. 

The derivative with respect to ti41 of S(aji41,ti41;Ui,t;) appears in 
the definition of H;. The derivative with respect to x;41 defines p;. But 
the derivative with respect to tj41 of S(xi41,ti41; Xi, ti) is related to the 
derivative with respect to 2;41, for the function $(a;41, ti+1; vi, t;) defined 
by (55) satisfies the Hamilton—Jacobi equation. Thus, the Hamilton—Jacobi 
equation is an equation expressing H; in terms of the p;. In other words, 
it expresses the fact that time displacements of states are related to space 
displacements of the same states. This idea leads directly to a derivation of 
the Schroedinger equation which is far more elegant than the one exhibited 
in deriving Eq. (30). 


20 We did not immediately substitute p; from (60) into (47) because (47) would then no 
longer have been valid to both zero order and the first order in ¢. We could derive the 
commutation relations, but not the equations of motion. The two expressions in (60) 
represent the momenta at each end of the interval t; to t;41. They differ by eV’(rz41) 
because of the force acting during the time e. 
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11. Inadequacies of the Formulation 


The formulation given here suffers from a serious drawback. The mathe- 
matical concepts needed are new. At present, it requires an unnatural and 
cumbersome subdivision of the time interval to make the meaning of the 
equations clear. Considerable improvement can be made through the use 
of the notation and concepts of the mathematics of functionals. However, 
it was thought best to avoid this in a first presentation. One needs, in 
addition, an appropriate measure for the space of the argument functions 
x(t) of the functionals.1° 

It is also incomplete from the physical standpoint. One of the most 
important characteristics of quantum mechanics is its invariance under uni- 
tary transformations. These correspond to the canonical transformations of 
classical mechanics. Of course, the present formulation, being equivalent to 
ordinary formulations, can be mathematically demonstrated to be invariant 
under these transformations. However, it has not been formulated in such 
a way that it is physically obvious that it is invariant. This incompleteness 
shows itself in a definite way. No direct procedure has been outlined to 
describe measurements of quantities other than position. Measurements of 
momentum, for example, of one particle, can be defined in terms of mea- 
surements of positions of other particles. The result of the analysis of such 
a situation does show the connection of momentum measurements to the 
Fourier transform of the wave function. But this is a rather roundabout 
method to obtain such an important physical result. It is to be expected 
that the postulates can be generalized by the replacement of the idea of 
“paths in a region of space-time R” to “paths of class R,” or “paths having 
property R.” But which properties correspond to which physical measure- 
ments has not been formulated in a general way. 


12. A Possible Generalization 


The formulation suggests an obvious generalization. There are interesting 
classical problems which satisfy a principle of least action but for which 
the action cannot be written as an integral of a function of positions and 
velocities. The action may involve accelerations, for example. Or, again, if 
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interactions are not instantaneous, it may involve the product of coordinates 
at two different times, such as f x(t)a(t+T)dt. The action, then, cannot be 
broken up into a sum of small contributions as in (10). As a consequence, 
no wave function is available to describe a state. Nevertheless, a transition 
probability can be defined for getting from a region R’ into another R”. 
Most of the theory of the transition elements (yz”|F'|q4-)g3 can be carried 
over. One simply invents a symbol, such as (R”|F'|R’) 3 by an equation such 
as (39) but with the expressions (19) and (20) for 7) and y substituted, and 
the more general action substituted for S. Hamiltonian and momentum 
functionals can be defined as in Sec. 10. Further details may be found in a 
thesis by the author.?! 


13. Application to Eliminate Field Oscillators 


One characteristic of the present formulation is that it can give one a sort of 
bird’s-eye view of the space-time relationships in a given situation. Before 
the integrations to the x; are performed in an expression such as (39) one 
has a sort of format into which various F' functionals may be inserted. One 
can study how what goes on in the quantum-mechanical system at different 
times is interrelated. To make these vague remarks somewhat more definite, 
we discuss an example. 

In classical electrodynamics the fields describing, for instance, the in- 
teraction of two particles can be represented as a set of oscillators. The 
equations of motion of these oscillators may be solved and the oscillators 
essentially eliminated (Lienard and Wiechert potentials). The interactions 
which result involve relationships of the motion of one particle at one time, 
and of the other particle at another time. In quantum electrodynamics 
the field is again represented as a set of oscillators. But the motion of the 


21 The theory of electromagnetism described by J. A. Wheeler and R. P. Feynman, Rev. 
Mod. Phys. 17, 157 (1945) can be expressed in a principle of least action involving the 
coordinates of particles alone. It was an attempt to quantize this theory, without refer- 
ence to the fields, which led the author to study the formulation of quantum mechanics 
given here. The extension of the ideas to cover the case of more general action functions 
was developed in his Ph.D. thesis, “The principle of least action in quantum mechanics” 
submitted to Princeton University, 1942. 
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oscillators cannot be worked out and the oscillators eliminated. It is true 
that the oscillators representing longitudinal waves may be eliminated. The 
result is instantaneous electrostatic interaction. The electrostatic elimina- 
tion is very instructive as it shows up the difficulty of self-interaction very 
distinctly. In fact, it shows it up so clearly that there is no ambiguity in 
deciding what term is incorrect and should be omitted. This entire process 
is not relativistically invariant, nor is the omitted term. It would seem 
to be very desirable if the oscillators, representing transverse waves, could 
also be eliminated. This presents an almost insurmountable problem in the 
conventional quantum mechanics. We expect that the motion of a particle 
a at one time depends upon the motion of 6 at a previous time, and vice 
versa. A wave function W(2a, Xv; t), however, can only describe the behav- 
ior of both particles at one time. There is no way to keep track of what b 
did in the past in order to determine the behavior of a. The only way is to 
specify the state of the set of oscillators at t, which serve to “remember” 
what b (and a) had been doing. 

The present formulation permits the solution of the motion of all the 
oscillators and their complete elimination from the equations describing 
the particles. This is easily done. One must simply solve for the motion 
of the oscillators before one integrates over the various variables x; for the 
particles. It is the integration over x; which tries to condense the past 
history into a single state function. This we wish to avoid. Of course, the 
result depends upon the initial and final states of the oscillator. If they are 
specified, the result is an equation for (xz |1|Yy/) like (38), but containing 
as a factor, besides exp(¢S/h) another functional G depending only on the 
coordinates describing the paths of the particles. 

We illustrate briefly how this is done in a very simple case. Suppose 
a particle, coordinate x(t), Lagrangian D(%, x) interacts with an oscillator, 
coordinate q(t), Lagrangian $(q? — w*q*), through a term 7(x, t)q(t) in the 
Lagrangian for the system. Here 7(a,t) is any function of the coordinate 
x(t) of the particle and the time.?? Suppose we desire the probability of a 
transition from a state at time ¢’, in which the particle’s wave function is 
wv and the oscillator is in energy level n, to a state at ¢’” with the particle 


22 The generalization to the case that y depends on the velocity, «, of the particle 
presents no problem. 
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in x, and oscillator in level m. This is the square of 


(aremlttbern)syasorsy = f+ f er lasdxir(as) 


x exp 5 (Sp + So + Sr) vx (£0) en (40) 


dro dqg — dxj—1 dqj-1 
z - dx dq; . (61) 


Here ¢,,(q) is the wave function for the oscillator in state n, S> is the action 


go 
ye 6 eines 
i=0 


calculated for the particle as though the oscillator were absent, 


P = é (Gii-G)" ew? , 
0— 2 7 q ‘itt 


1=0 


that of the oscillator alone, and 


(where y; = y(2;,t;)) is the action of interaction between the particle and 
the oscillator. The normalizing constant, a, for the oscillator is (27ei/h)~?. 
Now the exponential depends quadratically upon all the g;. Hence, the 
integrations over all the variables q; for 0 <i <j can easily be performed. 
One is integrating a sequence of Gaussian integrals. 

The result of these integrations is, writing T = t” — U¥, 
(QrihsinwT/w)~? exp i(Sp + Q(qj,q0))/h, where Q(q;, qo) turns out to be 
just the classical action for the forced harmonic oscillator (see reference 15)". 


Explicitly it is 


7) 
OG 40) = Se (cos wT)(q5 + 95) — 2440 

2 tt” 2 : t"” 

Bf y(eysin w(t eat + Bf 9(e)sinw(e" — at 
w Se Ww Sty 
2 tat 

aan =| i y(t)y(s) sin w(t” — t) sin w(s — t')dsdt| . 
Ww t! t! 


+ Editor’s note: This should read (see reference 21), i-e., Feynman’s doctoral thesis. 
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It has been written as though y(t) were a continuous function of time. The 
integrals really should be split into Riemann sums and the quantity (2, ti) 
substituted for y(¢;). Thus, Q depends on the coordinates of the particle 
at all times through the y(2;,¢;) and on that of the oscillator at times t’ 
and t” only. Thus, the quantity (61) becomes 


(xe Pm|L| We Pn) Sp+So0+Sr 


; rec) dx dx;-1 
= [ov [eee Gom exp (=) be (to) iets 7 dx; 


= (xt |Gmn| ve), 


which now contains the coordinates of the particle only, the quantity Gmn 
being given by 


Ginn = (2méh sin wT'/w)7? i; i yo", (ag) exp(4Q(4y,40)/) Pn (40)da; 440 - 


Proceeding in an analogous manner one finds that all of the oscillators 
of the electromagnetic field can be eliminated from a description of the 
motion of the charges. 


14. Statistical Mechanics: Spin and Relativity 


Problems in the theory of measurement and statistical quantum mechanics 
are often simplified when set up from the point of view described here. 
For example, the influence of a perturbing measuring instrument can be 
integrated out in principle as we did in detail for the oscillator. The sta- 
tistical density matrix has a fairly obvious and useful generalization. It 
results from considering the square of (38). It is an expression similar to 
(38) but containing integrations over two sets of variables dx; and dx’. The 
exponential is replaced by exp i(.S' — S’)/h, where S’ is the same function 
of the zi as S is of x;. It is required, for example, to describe the re- 
sult of the elimination of the field oscillators where, say, the final state of 
the oscillators is unspecified and one desires only the sum over all final 
states m. 

Spin may be included in a formal way. The Pauli spin equation can be 
obtained in this way: One replaces the vector potential interaction term 
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in S(@ia4, Xi), 
€ e€ 


5g xit1 — xi)» A(x) + —(ei41 — x4) - A(Ki41) 
Cc 2c 


arising from expression (13) by the expression 

So(O- (ig1 — 1))(@ - A(K:)) + (0 A(e41))(@ > (Xi41 — X:)). 
Here A is the vector potential, x;;1 and x; the vector positions of a particle 
at times ¢t;,, and ¢; and o is Pauli’s spin vector matrix. The quantity ® 
must now be expressed as |]; exp 1$(2;41,2;)/h for this differs from the 
exponential of the sum of S(x;41,2;). Thus, ® is now a spin matrix. 

The Klein Gordon relativistic equation can also be obtained formally 
by adding a fourth coordinate to specify a path. One considers a “path” as 
being specified by four functions «“)(r) of a parameter r. The parameter 
T now goes in steps € as the variable t went previously. The quantities 
a(t), 2) (t), x) (t) are the space coordinates of a particle and x (t) is 
a corresponding time. The Lagrangian used is 


do (de /dr)? + (e/c) (da /dr) Ay), 


where A,, is the 4-vector potential and the terms in the sum of js = 1, 2,3 are 
taken with reversed sign. If one seeks a wave function which depends upon 
m@ periodically, one can show this must satisfy the Klein Gordon equation. 
The Dirac equation results from a modification of the Lagrangian used for 
the Klein Gordon equation, which is analagous to the modification of the 
non-relativistic Lagrangian required for the Pauli equation. What results 
directly is the square of the usual Dirac operator. 

These results for spin and relativity are purely formal and add nothing 
to the understanding of these equations. There are other ways of obtaining 
the Dirac equation which offer some promise of giving a clearer physical 
interpretation to that important and beautiful equation. 

The author sincerely appreciates the helpful advice of Professor and 
Mrs. H. C. Corben and of Professor H. A. Bethe. He wishes to thank 
Professor J. A. Wheeler for very many discussions during the early states 
of the work. 
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Abstract 


Quantum mechanics was built up on a foundation of analogy with 
the Hamiltonian theory of classical mechanics. This is because the 
classical notion of canonical coordinates and momenta was found 
to be one with a very simple quantum analogue, as a result of 
which the whole of the classical Hamiltonian theory, which is just 
a structure built up on this notion, could be taken over in all its 
details into quantum mechanics. 

Now there is an alternative formulation for classical dynamics, 
provided by the Lagrangian. This requires one to work in terms 
of coordinates and velocities instead of coordinates and momenta. 
The two formulations are, of course, closely related, but there are 
reasons for believing that the Lagrangian one is the more funda- 
mental. 

In the first place the Lagrangian method allows one to collect 
together all the equations of motion and express them as the sta- 
tionary property of a certain action function. (This action function 
is just the time-integral of the Lagrangian.) There is no correspond- 
ing action principle in terms of the coordinates and momenta of the 
Hamiltonian theory. Secondly the Lagrangian method can easily be 
expressed relativistically, on account of the action function being a 
relativistic invariant; while the Hamiltonian method is essentially 
non-relativistic in form, since it marks out a particular time vari- 
able as the canonical conjugate of the Hamiltonian function. 

For these reasons it would seem desirable to take up the ques- 
tion of what corresponds in the quantum theory to the Lagrangian 
method of the classical theory. A little consideration shows, how- 


* This article was first published in Physikalische Zeitschrift der Sowjetunion, Band 3, 
Heft 1 (1933), pp. 64-72. 
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ever, that one cannot expect to be able to take over the classi- 
cal Lagrangian equations in any very direct way. These equations 
involve partial derivatives of the Lagrangian with respect to the 
coordinates and velocities and no meaning can be given to such 
derivatives in quantum mechanics. The only differentiation process 
that can be carried out with respect to the dynamical variables of 
quantum mechanics is that of forming Poisson brackets and this 
process leads to the Hamiltonian theory.! 

We must therefore seek our quantum Lagrangian theory in an 
indirect way. We must try to take over the ideas of the classical 
Lagrangian theory, not the equations of the classical Lagrangian 
theory. 


Contact Transformations 


Lagrangian theory is closely connected with the theory of contact transfor- 
mations. We shall therefore begin with a discussion of the analogy between 
classical and quantum contact transformations. Let the two sets of vari- 
ables be p,, g, and P,, Q,, (r = 1, 2,..., ) and suppose the q’s and Q’s 
to be all independent, so that any function of the dynamical variables can 
be expressed in terms of them. It is well known that in the classical theory 
the transformation equations for this case can be put in the form 


Os Os 


~ Ba BGr 


Pr 


where S$ is some function of the q’s and Q’s. 
In the quantum theory we may take a representation in which the q’s 
are diagonal, and a second representation in which the Q’s are diagonal. 


1 Processes of partial differentiation with respect to matrices have been given by Born, 
Heisenberg and Jordan (ZS. f. Physik 35, 561, 1926) but these processes do not give 
us means of differentiation with respect to dynamical variables, since they are not in- 
dependent of the representation chosen. As an example of the difficulties involved in 
differentiation with respect to quantum dynamical variables, consider the three compo- 
nents of an angular momentum, satisfying 


MaeMy — MyMz = thm; . 


We have here mz expressed explicitly as a function of mz and my, but we can give no 
meaning to its partial derivative with respect to mz or my. 
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There will be a transformation function (q’|Q’) connecting the two rep- 
resentations. We shall now show that this transformation function is the 
quantum analogue of e'9/", 

If a is any function of the dynamical variables in the quantum theory, 
it will have a “mixed” representative (q’|a|Q’), which may be defined in 
terms of either of the usual representatives (q’|a|q”), (Q’|a|Q”) by 


(q/|alQ’) = i (q/o|q”)dq""(q'1Q’) = / (d/|Q")dQ"(Q"|a|Q!).. 


From the first of these definitions we obtain 


(q'|ar|Q'1) = @-(¢'1Q") (2) 

(a prlQ) = ihe 12) (3) 
and from the second 

(a'lQr1@')) = @.(a'12) (4) 

(a |PeIQ')) = thse a'Q). (5) 


Note the difference in sign in (3) and (5). 
Equations (2) and (4) may be generalised as follows. Let f(q) be any 
function of the q’s and g(Q) any function of the Q’s. Then 


(d|F(@9(Q)1@") = / i (a'|F(a)la")aa""(a"1Q")4Q" (Q"g(Q)|@") 
= f(7/)g(Q'V(7|Q’). 


Further, if f.(q) and gz(Q), (k = 1, 2,..., m) denote two sets of functions 
of the q’s and Q’s respectively, 


(qe fe(Qgn(Q)1Q") = Ux fe(q’)gn(Q") - (g'|Q") - 


Thus if a is any function of the dynamical variables and we suppose it to be 


‘ 


expressed as a function a(qQ) of the q’s and Q’s in a “well-ordered” way, 
that is, so that it consists of a sum of terms of the form f(q)g(Q), we shall 


have 


(la(gQ)1Q') = a(7Q‘)(7'1Q"). (6) 
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This is a rather remarkable equation, giving us a connection between a(q¢Q), 
which is a function of operators, and a(q’Q’), which is a function of numer- 
ical variables. 


Let us apply this result for a = p,. Putting 


('|Q’) = eiU/h ‘ (7) 
where U is a new function of the q’’s and Q’’s we get from (3) 
OU qd'Q’ 
(d\priQ’) = MEP ari, 
qr 
By comparing this with (6), we obtain 
_ JU (aQ) 
Pr Oar 


as an equation between operators or dynamical variables, which holds pro- 
vided 0U/0q, is well-ordered. Similarly, by applying the results (6) for 
a = Pr and using (5), we get 
_ 0U (GQ) 

OQ, 
provided 0U/0Q,. is well-ordered. These equations are of the same form 


as (1) and show that the U defined by (7) is the analogue of the classical 
function S, which is what we had to prove. 


P= 


Incidentally, we have obtained another theorem at the same time, 
namely that Eqs. (1) hold also in the quantum theory provided the right- 
hand sides are suitably interpreted, the variables being treated classically 
for the purpose of the differentiations and the derivatives being then well- 
ordered. This theorem has been previously proved by Jordan by a different 
method.? 


The Lagrangian and the Action Principle 
The equations of motion of the classical theory cause the dynamical 
variables to vary in such a way that their values q, p, at any time ¢ 


are connected with their values gr, pr at any other time T by a contact 


2 Jordan, ZS. f. Phys. 38, 513, 1926. 
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transformation, which may be put into the form (1) with q, p = q, pe; Q, 
P =gqr, pr and S equal to the time integral of the Lagrangian over the 
range T to t. In the quantum theory the q@, p; will still be connected with 
the gr, pr by a contact transformation and there will be a transformation 
function (q|¢r) connecting the two representations in which the q, and the 
qr are diagonal respectively. The work of the preceding section now shows 
that 


t 
(q|gr) corresponds to exp i f Lat ’ (8) 
T 


where L is the Lagrangian. If we take T to differ only infinitely little from 
t, we get the result 


(qt+ae|Gz) Corresponds to exp [iL dt/h] . (9) 


The transformation functions in (8) and (9) are very fundamental things 
in the quantum theory and it is satisfactory to find that they have their 
classical analogues, expressible simply in terms of the Lagrangian. We 
have here the natural extension of the well-known result that the phase of 
the wave function corresponds to Hamilton’s principle function in classical 
theory. The analogy (9) suggests that we ought to consider the classical 
Lagrangian, not as a function of the coordinates and velocities, but rather 
as a function of the coordinates at time ¢ and the coordinates at time t+ dt. 

For simplicity in the further discussion in this section we shall take the 
case of a single degree of freedom, although the argument applies also to 
the general case. We shall use the notation 


exp [ [ “Lt = A(tT), 


so that A(tT) is the classical analogue of (q|¢r). 
Suppose we divide up the time interval T — t into a large number of 


small sections T — t1, t) — ta,..., tn—-1 — tm, tm — t by the introduction 
of a sequence of intermediate times t1, ta,..., tm. Then 
A(tT) = A(ttm)A(tmtm—1)-..A(teti)A(tT) . (10) 


Now in the quantum theory we have 


(aslar) = f (ael4m)€dradr|im—1)don—1 «+ (galen)dan anlar) (11) 
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where g, denotes q at the intermediate time t,, (kK = 1, 2,..., m). 
Equation (11) at first sight does not seem to correspond properly to 
Eq. (10), since on the right-hand side of (11) we must integrate after 
doing the multiplication while on the right-hand side of (10) there is 
no integration. 

Let us examine this discrepancy by seeing what becomes of (11) when 
we regard ¢ as extremely small. From the results (8) and (9) we see that 
the integrand in (11) must be of the form e‘”/” where F is a function of 
qT; V1; 92---; Gm; @ Which remains finite as h tends to zero. Let us now 
picture one of the intermediate q’s, say qx, aS varying continuously while the 
others are fixed. Owing to the smallness of h, we shall then in general have 
F/h varying extremely rapidly. This means that e’” /h will vary periodically 
with a vary high frequency about the value zero, as a result of which its 
integral will be practically zero. The only important part in the domain of 
integration of q, is thus that for which a comparatively large variation in qx, 
produces only a very small variation in F’. This part is the neighbourhood 
of a point for which F' is stationary with respect to small variations in qx. 

We can apply this argument to each of the variables of integration in 
the right-hand side of (11) and otbain the result that the only important 
part in the domain of integration is that for which F is stationary for small 
variations in all the intermediate q’s. But, by applying (8) to each of the 


small time sections, we see that F' has for its classical analogue 


t bits te ty t 
, Lat+ | Latte + [ Lat+ [ Lat= | Ldt, 
t t ty T T 


™m m-1 


which is just the action function which classical mechanics requires to be 
stationary for small variations in all the intermediate q’s. This shows the 
way in which Eq. (11) goes over into classical results when h becomes 
extremely small. 

We now return to the general case when h is not small. We see that, for 
comparison with the quantum theory, Eq. (10) must be interpreted in the 
following way. Each of the quantities A must be considered as a function 
of the q’s at the two times to which it refers. The right-hand side is then a 
function, not only of gr and q@, but also of qi, g2,---, Gm, and in order to 
get from it a function of gr and q only, which we can equate to the left- 
hand side, we must substitute for qi, g2,---, Gm their values given by the 
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action principle. This process of substitution for the intermediate q’s then 
corresponds to the process of integration over all values of these q’s in (11). 

Equation (11) contains the quantum analogue of the action principle, as 
may be seen more explicitly from the following argument. From Eq. (11) 
we can extract the statement (a rather trivial one) that, if we take specified 
values for gr and q, then the importance of our considering any set of 
values for the intermediate q’s is determined by the importance of this set 
of values in the integration on the right-hand side of (11). If we now make h 
tend to zero, this statement goes over into the classical statement that, if we 
take specified values for gr and q, then the importance of our considering 
any set of values for the intermediate q’s is zero unless these values make 
the action function stationary. This statement is one way of formulating 


the classical action principle. 


Application to Field Dynamics 


We may treat the problem of a vibrating medium in the classical theory by 
Lagrangian methods which form a natural generalisation of those for par- 
ticles. We choose as our coordinates suitable field quantities or potentials. 
Each coordinate is then a function of the four space-time variables 2, y, z, 
t, corresponding to the fact that in particle theory it is a function of just 
the one variable t. Thus the one independent variable t of particle theory 
is to be generalised to four independent variables x, y, z, t.? 

We introduce at each point of space-time a Lagrangian density, which 
must be a function of the coordinates and their first derivatives with respect 
to xz, y, z and t, corresponding to the Lagrangian in particle theory being 
a function of coordinates and velocities. The integral of the Lagrangian 
density over any (four-dimensional) region of space-time must then be 
stationary for all small variations of the coordinates inside the region, pro- 
vided the coordinates on the boundary remain invariant. 


3 It is customary in field dynamics to regard the values of a field quantity for two 
different values of (x, y, z) but the same value of t as two different coordinates, instead of 
as two values of the same coordinate for two different points in the domain of independent 
variables, and in this way to keep to the idea of a single independent variable t. This point 
of view is necessary for the Hamiltonian treatment, but for the Lagrangian treatment the 
point of view adopted in the text seems preferable on account of its greater space-time 
symmetry. 
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It is now easy to see what the quantum analogue of all this must be. If 
S denotes the integral of the classical Lagrangian density over a particular 
region of space-time, we should expect there to be a quantum analogue 
of e*5/" corresponding to the (q|qr) of particle theory. This (q|qr) is a 
function of the values of the coordinates at the ends of the time interval to 
which it refers and so we should expect the quantum analogue of e**/" to 
be a function (really a functional) of the values of the coordinates on the 
boundary of the space-time region. This quantum analogue will be a sort of 
“generalized transformation function”. It cannot in general be interpreted, 
like (q|qr), as giving a transformation between one set of dynamical vari- 
ables and another, but it is a four-dimensional generalization of (q:|¢r) in 
the following sense. 


Corresponding to the composition law for (q|¢r) 


Cre / iehanean, (12) 


the generalized transformation function (g.t.f.) will have the following com- 
position law. Take a given region of space-time and divide it up into two 
parts. Then the g.t.f. for the whole region will equal the product of the 
g.t.f.’s for the two parts, integrated over all values for the coordinates on 
the common boundary of the two parts. 

Repeated application of (12) gives us (11) and repeated application of 
the corresponding law for g.t.f.’s will enable us in a similar way to connect 
the g.t.f. for any region with the g.t.f.’s for the very small sub-regions 
into which that region may be divided. This connection will contain the 
quantum analogue of the action principle applied to fields. 

The square of the modulus of the transformation function (q:\gr) can 
be interpreted as the probability of an observation of the coordinates at 
the later time ¢ giving the result q for a state for which an observation 
of the coordinates at the earlier time T is certain to give the result qr. 
A corresponding meaning for the square of the modulus of the g.t.f. will 
exist only when the g.t.f. refers to a region of space-time bounded by 
two separate (three-dimensional) surfaces, each extending to infinity in the 
space directions and lying entirely outside any light-cone having its vertex 
on the surface. The square of the modulus of the g.t.f. then gives the 
probability of the coordinates having specified values at all points on the 
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later surface for a state for which they are given to have definite values at 
all points on the earlier surface. The g.t.f. may in this case be considered 
as a transformation function connecting the values of the coordinates and 
momenta on one of the surfaces with their values on the other. 

We can alternatively consider |(q|¢r)|? as giving the relative a priori 
probability of any state yielding the results gr and q when observations of 
the q’s are made at time T and at time ¢ (account being taken of the fact that 
the earlier observation will alter the state and affect the later observation). 
Correspondingly we can consider the square of the modulus of the g.t.f. for 
any space-time region as giving the relative a priori probability of specified 
results being obtained when observations are made of the coordinates at 
all points on the boundary. This interpretation is more general than the 
preceding one, since it does not require a restriction on the shape of the 


space-time region. 
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